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FOREWORD TO VOLUME 1 

THE study of the physical properties of metals has developed through 
a number of stages. The first was that in which the mechanical 
properties were correlated empirically with the heat treatment to 
which the metal had been subjected and, sometimes, to the chemical 
composition . At this stage the successful treatment of metals was an 
art, in the sense that experience rather than understanding led to the 
most satisfactory results. The next stage, in which the internal struc- 
ture of the metal was examined, was based originally on the use of the 
microscope and it was found that many experimental facts could be 
explained in terms of effects that were of the right size to be seen under 
magnifications of less than about two thousand. The development of 
the x-ray diffraction techniques allowed phenomena of a smaller order 
of magnitude to be examined, and much of the existing information 
was found to be comprehensible in terms of the geometry of the crystal 
structure of the various phases that were visible under the microscope. 

More recent development can perhaps best be discussed by a division 
of the field into what may be termed 'statics' and 'dynamics'. Under 
the former heading is the study of the conditions which govern the 
structure of g? m^fcal or alloy when it is in thermodynamic and mechanical 
equilibrium. The ftieories of the phases that are present in equilibrium 
and of the elastic constants have made remarkable progress in terms of 
rapidly developing theory of the part played by electrons in the metal. 

Under the heading 'dynamic' effects we may include both the 
conditions governing the approach to equilibrium in respect of the 
phases that are present, in which diffusion plays an important part, 
and the response of a metal to forces which are sufficient to cause 
non-recoverable or plastic mechanical deformation. 

These and associated subjects have advanced so rapidly that it has 
become difficult for research workers in one part of the field to remain 
up to date in other branches. It is the purpose of this volume, which 
is the first of an annual series, to present authoritative reviews of the 
present state of knowledge in specialized aspects of the field that includes 
both physical metallurgy and metal physics. It is not intended that any 
one volume should form a complete textbook on these subjects. It is 
hoped rather that a few subjects of current interest should be discussed 
rather fully so as to cover, in the course of several years, all the more 
important aspects in which progress is being made. In order to m&ke 
the series reasonably self contained it is proposed that the necessary 
'historical' background should be included the first time a particular 
subject is discussed. Subsequent articles on such subjects will generally 
only cover the more recent progress. 

B. CHALMERS 
vii 
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INTERNAL FRICTION IN METALS 

A . S. Nowick 

THE capacity of a vibrating solid to convert its mechanical energy oi 
vibration into heat, even when so well isolated that energy losses to its 
surroundings are negligible, is called internal friction. The most common 
manifestation of internal friction is the damping, or loss of vibration 
amplitude, of a freely vibrating body. Interest in internal friction in 
metals undoubtedly extends back to prehistoric times, to the earliest 
cymbals and bells. Modqrn investigation has followed two different 
channels. On the one hand there has been widespread interest in 
damping for engineering applications ; depending on the particular 
application, one may wish to obtain either a very high or a very low 
rate of damping. In work of this kind, the effect of heat treatments and 
alloying elements on the damping characteristics of engineering materials 
is investigated and commonly reported in terms of a quantity called 
"damping capacity." Vibration amplitudes employed are commen- 
surate with those that occur in practice. An adequate review of this 
phase of the subject would in itself require a sizeable article. A second 
approach to internal friction, with which the present review is con- 
cerned, is the use of damping as a tool to study internal structure and 
atomic movements in solids. This is a relatively new field of investiga- 
tion and has been developing rapidly. Interest is focused on the origin 
and mechanism of internal friction rather than its desirability, and the 
vibration amplitudes employed are very small. 

A perfectly elastic material will not produce damping since, under 
oscillatory conditions, stress and strain are always in phase, and 
consequently no mechanical hysteresis or loss of vibrational energy can 
take place. Damping is, therefore, a result of the non-elastic behaviour 
of materials. The fact that it can be observed at stress levels far below 
those at which plastic flow occurs, shows that there is really no such 
thing as an "elastic range." In recent years there has been a great 
increase in understanding of those physical processes that result in 
non-elacstic behaviour, and, therefore, in damping, of metals at low^ 
stress levels. Review articles on this subject, written in the period 
1939-1943, 1 " 3 emphasize three main sources of damping: thermo- 
elasticity, ferromagnetic effects, arid the internal stresses associated 
with cold work, or dislocations. Of these, the last was least understood, 
in the sense that the mechanism by which these internal distortions act 
to produce damping was not established. During the war and early 
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post-war period intensive investigations of internal friction, particularly 
by ZENER and co-workers, brought to light other sources of internal 
friction, including stress relaxation across grain boundaries and stress- 
induced ordering. These last two mechanisms together with thermo- 
elastic internal friction fitted into a scheme of non-elastic behaviour 
which ZENER 4 called (inelasticity, the development of which culminated 
in his book 5 in 1948. In the treatment of anelasticity it is ghown that 
not only internal friction but several related static effects (recoverable 
creep, stress relaxation, and elastic after-effect at low stress levels) 
result from this type of non-elastic behaviour. Only internal friction 
that does not depend on the amplitude of vibration is considered in this 
treatment. The impact of this important work has been so great as ta 
lead to the widespread misconception that all internal friction at low 
amplitudes can be explained in terms of anelasticity. The internal 
friction originating in cold work is an appropriate example ; theories 
attempting to explain observed effects in terms of the scheme of anelas- 
tic phenomena, 6 " 8 have not met with notable success. In fact, after 
R^AD 9 10 showed that cold work internal friction in single crystals is 
strongly amplitude dependent, the writer demonstrated 11 * 12 that this 
behaviour originates in a mechanism of static hysteresis ^which is quite 
different from anelasticity. 

In the present article, an attempt is first made to provide a pheno- 
menological description of the general features of non-elastic deforma- 
tion which is sufficiently general to include, as special cases, anelasticity 
as well as static hysteresis and amplitude dependent internal friction. 
The first three main sections deal with this phenomenological approach. 
It is shown that internal friction cannot be divorced from an equally 
important effect : the change in the effective elastic modulus resulting 
from the non-elastic behaviour, which is called "the AM-effect". 
Following the phenomenological treatment, the various experimental 
methods used to measure internal friction are reviewed. The remainder 
of the article is devoted to the physical origins of internal friction. All 
sources of damping involve an internal re-arrangement that takes place 
under stress. There are three types of re-arrangements that have been 
studied: thermal (the thermo-elastic effect), magnetic, and atomic 
re-arrangements. All of these are considered in the present article, 
although emphasis is placed upon the latter category, since most recent 
developments are in this field and it is the least adequately reviewed. 
Included in the category of atomic re-arrangements are diffusion effects, 
relaxation across interfaces, and the motion of dislocations. 

PHENOMENOLOGICAL THEORY: HOMOGENEOUS STRAIN 

In most vibration experiments the specimen under investigation is 
subjected to a non -homogeneous strain. A discussion of the elastic and 
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non-elastic behaviour of the specimen is, therefore, complicated by the 
dependence of stress and strain on position in the specimen. In this 
section, the principles of non-elastic behaviour will be introduced under 
the assumption that we are dealing either with a small element of a 
specimen within which the strain is uniform, or with a complete speci- 
men throughout which the strain is the same at any instant. The latter 
condition c'an be realized, for a specimen of appropriate shape, by the 
introduction of an auxiliary inertia member relative to which the 
inertia of the specimen itself is negligible. For example, it is possible to 
set up a homogeneous shear strain by torsional vibration of a hollow 
tube to which is attached a heavy inertia member. 

.For simplicity of notation, it is assumed that the stress system within 
the specimen can be defined by a single component a. Furthermore, 
only the principal strain *, corresponding to the stress cr, need be 
considered, since other strain components do not enter into expressions 
for the work done by the stress. The symbol M is used to represent the 
appropriate elastic modulus that relates stress and strain. 

As already mentioned, in order for internal friction to occur, *he 
oscillatory stress and strain must not be in phase with each other and, 
in fact, in orc'er that energy be dissipated, the strain must lag behind 
the stress. Let us represent this angle of lag by $. Then (f> is a con- 
venient measure of internal friction, since the energy dissipated in a 
cycle of vibration goes to zero when (f> approaches zero. It is convenient 
to use complex notation to represent the various phase relationships. 
Thus, the variation of stress with time is expressed as 

cr = cr 1 c iarf .... (1) 

where co is the angular frequency of vibration (a> = 27r/, where / is the 
number of vibrations per second) and a l is the stress amplitude. 
Correspondingly, the strain is 

B = (^-i%)e M .... (2) 

where the quantities ^ and 2 are the amplitudes of the components 
of strain in phase with and 90 behind the stress, respectively. Clearly, 

tan (f> = e 2 /e 1 .... (3) 

Throughout this article it will be assumed that the internal friction is 
small, i.e. e 2 <^ e l and tan </> ^ </>. This approximation greatly simplifies 
the* relations between various measures of internal friction, and there 
have been very few experiments to which it does not apply. An 
appropriate definition for the dynamic modulus, M, is 

M = a l /e 1 .... (4) 

It is also convenient to define the complex modulus, &, as the ratio of 
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stress tto strain, using the complex notation of equations (1) and (2). 
Thus, 

hty) . . . . (5) 



e l ^e 2 s^l ^ tan <j>) 

where the approximation of small </> has been introduced. The real part 
of the complex modulus is M and the ratio of the imaginary part to the 
real part is the phase angle </>, which is our present measure of internal 
friction. The complex modulus, therefore, contains the essential para- 
meters pertaining to the vibration. 

The occurrence of non-elastic behaviour means more than the fact 
that (f> ^ 0. In order to discuss the concept of non-elastic behaviour we 
must consider the law that describes the interrelation of stress and 
strain, not only under conditions of oscillation, but also under static 
conditions, e.g. the application of a constant stress. A perfect ly elastic 
substance is one for which the strain is uniquely determined by the 
stress at any instant, i.e. a monotonic functional relationship exists 
between the two quantities. This function only deviates from Hooke's 
law of proportionality when the strain is large. Since we are concerned 
entirely with small deformations, a simple law of proportionality may 
always be used to represent the stress-strain equation for a perfectly 
elastic substance. A material that shows non-elastic behaviour is one for 
which the stress-strain equation is not a simple proportionality but 
assumes a generalized form, in which, for example, there may be terms 
involving time derivatives of stress and strain or terms^that represent 
multiple-valued functional relationships. For self consistency with the 
requirement of small damping, we may limit ourselves to stress-strain 
equations where all these non-elastic terms are very much smaller than 
the elastic terms. If the stress-strain equation is linear in stress, strain, 
and time derivatives of these quantities, it can be solved by the substi- 
tution of equations (1) and (2) to obtain the complex modulus directly. 
If, as a result of the non-elastic behaviour, terms which are nonlinear 
appear in the stress-strain equation, an exact solution can no longer be 
obtained by the substitution of a simple harmonic stress and strain, but 
requires complete Fourier series for these quantities, i.e. stress and 
strain may be periodic but not sinusoidal. An approximate solution 
can be obtained 13 if we ignore everything but the fundamental Fourier 
components of stress and strain and determine the complex modulus 
from these quantities. The parameters M and (f> obtained from the real 
and imaginary parts of this complex modulus will depend on the ampli- 
tude of vibration, as a consequence of the nonlinear terms. We then 
speak of "amplitude dependent internal friction" and, in general, also 
of an "amplitude dependent dynamic modulus." This method, which 
converts a nonlinear problem into an approximately equivalent linear 

4 



INTERNAL FRICTION IN METALS 

problem with parameters dependent on the amplitude of vibration, is 
known as the method of equivalent linearization. The approximation 
is ft good one when the nonlinear terms are small, a condition which is 
certainly valid in the present case, since all nonlinear terms are among 
the small non-elastic terms. 

We will now consider two well-known theories of non-elastic behavi- 
our which lead to very simple stress-strain equations. 

The Voigt Solid 

In order to account for non-elastic behaviour of materials, VoiGT 14 
made the assumption (previously introduced by STOKES 15 and others) 
that, in addition to the elastic stress Me there is a frictional resistance 
proportional to the rate of strain e. We, therefore, obtain the stress - 
strain equation 

\ = Me + r] .... (6) 

This is a linear equation and, therefore, leads to an amplitude indepen- 
dent internal friction. A material that can be described in terms of 
equation (6) is called a Voigt solid. The complex modulus is 
M = M + ieor], so that the internal friction is given by 

^ = fo/JM> .... (7) 

The internal friction of a Voigt solid is, therefore, proportional to the 
frequency of vibration and its dynamic modulus is a constant 
(independent of frequency). 

The Voigt assumptions are most often used in textbooks on theoretical 
physics, 16 to discuss the damped oscillations of a mass point attached 
to a weightless spring. The purpose of such treatments is usually to 
show how internal friction manifests itself in free and forced vibrations. 
No attention is usually paid to the question of the frequency dependence 
of internal friction. 

The Maxwell Solid 

An alternative description of non-elastic behaviour is due to MAXWELL. 17 
Rather than use the nebulous concept of an internal frictional force, 
this approach first assumes that the strain is made up of two parts : 

= E + e" .... (8) 

where E is the perfectly elastic strain which obeys Hooke's law 

e' = a/M' .... (9) 

and e" is the non-elastic strain which results from internal re-arrangement 
and is not proportional to the stress. Equation (9) is to be regarded as 
the definition of M '. MAXWELL further assumes that the non-elastic 
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term s" obeys the Newtonian viscosity equation, i.e. the flow e" is 
proportional to the stress, or 

e" = for .... (10) 

The stress-strain equation is obtained by combining equations (8), (9) 
and (10): 



From this equation and the assumption of small internal friction, we 
obtain 

M = M r ; ^= {M'/co .... (12) 

Again the dynamic modulus is a constant, but now </> is inversely 
proportional to the frequency. The amplitude independence of <f> is 
related to the linearity of equation (11). 

It has been known for many years that the experimentally observed 
frequency dependence of internal friction, for most materials, is in 
agreement with neither the VOIGT nor the MAXWELL descriptions of 
nan-elastic behaviour. Another shortcoming of these descriptions is the 
prediction, in both cases, that the dynamic modulus of the system is 
unaffected by the non-elastic behaviour, except for tennis *of second order 
in the internal friction which are omitted here. Experimental observa- 
tions show that the occurrence of internal friction is often accompanied 
by a fractional decrease in the dynamic modulus, of the same order of 
magnitude as the internal friction. 

Various methods have been suggested for the generalization of the 
Voigt and Maxwell solids to obtain a more widely applicable theory. 
These methods regard the properties of an actual solid as a super- 
position of those of a large collection (finite or infinite) of Voigt or of 
Maxwell solids each with different parameters. In this way it is possible 
to introduce into the description of the non-elastic behaviour of a 
material as many arbitrary parameters as are necessary to describe its 
observed behaviour. Such superposition methods have been particularly 
popular in the study of mechanical properties of high polymers. 18 ' 19 
They are limited, however, to the treatment of materials whose internal 
friction is independent of the amplitude of vibration, and also, cannot 
be used for the study of static hysteresis (described in a later section). 
It will, therefore, be more suitable for us to introduce a different 
'approach, which is sufficiently general to include all phenomena that 
lead to internal friction in metals. 

The Generalized Maxwell Approach 

The first assumption of the Maxwell description is that a material 
under stress may undergo internal re-arrangement leading to a non- 
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elastic strain e" in addition to the elastic strain s. This assumption, 
expressed by equations (8) and (9), is very generally applicable to non- 
elastic phenomena. On the other hand, we will drop the highly restric- 
tive second assumption expressed by equation (10) and, for the present, 
leave unspecified the equation that describes the non-elastic strain e". 
Without such an equation, we cannot obtain the stress-strain relation 
required to complete the description of the behaviour of a material. 
Possible equations for e" will be discussed in a later section ; at present, 
phase relationships will be discussed in general terms. 




Fig. 1. Phase relationships between the stress and the elastic 
and non-elastic strain 



If the stress (or its fundamental Fourier component) is represented 
by equation 1, the elastic strain is, 



= 



(13) 



The non-elastic strain s" will, in general, be resolvable into a component 
in phase with the stress and one lagging by 90, or 

s" = (/' ie 2 ")e iojt .... (14) 

where the quantities /, /' and 2 " are appropriate amplitudes and are 
real and positive quantities. Fig. 1 shows the phase relationships 
between stress and components of strain. This diagram is not drawn to 
scale, since the assumption of small internal friction and non-elastic 
effects means that e"| <^ \e r . With this approximation e x g^ / (to 
first order), and equation (3) becomes 



The internal friction is, to the first approximation, dependent only on 
the component of e" out of phase with stress. The result is not surprising 
since only the component of strain out of phase with the stress will 
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contribute to energy dissipation. The dynamic modulus M is given by 
(see equation (4)) 



where M ' is defined by equation (9). Thus, since the presence of non- 
elastic behaviour may increase the in-phase component o strain, the 
dynamic modulus is less than (or equal to) M', and the difference. 
Alf = M' M is given by 

AJf/Jf = ^W .... (17) 

This lowering of the elastic modulus because of the existence of a non- 
elastic strain will be called the AJIf effect. This discussion has, therefore, 
shown that internal friction and the Alf effect are closely related con- 
sequences of the occurrence of non-elastic strain, the first resulting from 
the component out of phase with stress, and the second from the 
component in phase with the stress. It is also clear that if the equation, 
thus far unspecified, that describes e" is nonlinear, the ratios e 2 "/ e i' an d 
i'7 i' w iU be functions of strain amplitude. Specific forms for this 
equation will be considered in the section called "Visn^clasticity and 
Static Hysteresis." 

Measures of Internal Friction 

The relationship between various measures of internal friction, as well 
as of the AJf effect, will now be considered. As long as the discussion 
of non-elastic behaviour is restricted to the use of quantities such as the 
complex modulus and phase angle </>, it is possible to treat in the same 
way, either a small element of material or a complete specimen under 
homogeneous strain. If quantities such as the logarithmic decrement 
and the width of a resonance peak are to be considered, it is necessary to 
deal with an entire specimen in a particular mode of vibration. For 
simplicity, we will discuss here only specimens vibrating under homo- 
geneous strain, and in the next section will show how the present dis- 
cussion is generalized when the strain is not homogeneous. It is required, 
then, ths-t a relatively large inertia member be attached to the specimen 
under consideration. If represents the displacement of the inertia 
member, I its "inertia," F e and F a the ' 'force" exerted upon this member 
by the specimen and the externally applied "force," respectively, then 
the equation of motion of this member is 

Id = F e + F a .... (18) 

(In the example of the torsion of a hollow tube mentioned earlier, is 
simply the angle of twist, / the moment of inertia, and the quantities 
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F are torques. On the other hand, if the vibration is under a longitu- 
dinal load, becomes the longitudinal displacement, / the attached mass, 
arid F a force.) The quantity 6 is essentially the strain, e, throughout 
the specimen, except for a geometric constant of proportionality, 
while F e is proportional to the stress a, but opposite in sign. Inasmuch 
as a Ms (by the definition of $, equation (5)), with M = M(\ + i</>), 
we may alsb write 

F e = -kO; k = k(l+i<f>) .... (19) 

where k is a complex "force constant." Equation (18) may then be solved 
in the case of forced vibrations with an impressed angular frequency co, 
"i.e. when the applied "force", is taken as sinusoidal of the form 
F a = F Q e llot . Under steady state conditions, the equation will have a 
solution of the form = 0Je ia> ', where is, in general, complex. (If the 
stress-strain equation is nonlinear, 6 will actually be representable by a 
complete Fourier series, but as mentioned earlier, we will ignore all, but 
the fundamental Fourier components.) When this solution and 
equation (19) are substituted into equation (18), an equation for,0 
is obtained : 



where 

co r z = k/I .... (21) 

The real quantity |0 | 2 is given by 



This is the well-known equation for a resonance curve (a plot of |0 | 2 
versus co) with a resonant angular frequency (at which the amplitude of 
vibration is a maximum) equal to o> r . If the frequencies at which 
1 1 2 falls to half the maximum value are denoted by a) L and o> 2 it 
follows that 



The internal friction </> may, therefore, be obtained in forced vibration as 
the ratio of the width of the resonance curve (square of displacement 
amplitude versus frequency) at half maximum divided by the resonant 
frequency. Inasmuch as the reciprocal of this ratio is called the "Q" 
of an electrical circuit, the symbol Q~ l has been adopted by some 
authors 1 ' 20 as a measure of internal friction ; from equation (23), Q~ l is 
equal to the phase angle between stress and strain. The study of 
internal friction by means of a resonance curve is complicated when the 
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internal- friction is amplitude dependent. When <f> is itself a function of 
|0 |, and in fact (as commonly observed experimentally) an increasing 
function of |0 |, the height of the resonance peak will be determined 
by the value of </> at maximum amplitude, but the peak is broader than 
for the case of constant internal friction. Although the resonance peak 
is broadened by the amplitude dependence of </>, it may still be sym- 
metrical about co = a) r (to terms of first order in the internal friction) so 
long as there is no AM effect. In general, however, amplitude depen- 
dent internal friction is accompanied by an amplitude dependent 
dynamic modulus M . This means that k is a function of amplitude and, 
therefore, that co r as well as <, in equation (22), is dependent on |0 |. 
In the usual case, where the AJf effect increases (and, therefore, a> r 
decreases) with increasing amplitude, it is readily shown from equation 
(22) as well as experimentally 11 that the resonance curve falls off faster 
on the low frequency side than on the high frequency side. If the ampli- 
tude dependence of co r is sufficiently large, this asymmetry may become 
so great that the resonance curve bends over on itself and becomes 
double valued on the low frequency side. Thus, discontinuities may be 
observed in tracing this curve experimentally. 21 

When amplitude dependent effects are studied by fopced vibration 
methods, it is usually preferable to work at exactly the resonant 
frequency where, from equation (22), the amplitude at resonance 
|0 | is 

|e o |= *V(/o> r V) '. . . . (24) 

It is necessary to determine the maximum amplitude |0 m | and the 
force F Q in order to obtain <f> ; both </> and co r are then known at the 
specific amplitude |0 W |. By variation of F it is then possible to 
obtain these values at any other displacement amplitude. Whereas, in 
principle this method is a very simple one for determination of the 
internal friction and AJf effect as a function of amplitude, the quantities 
F and |0 W | are often difficult to obtain experimentally. In the piezo- 
electric method (to be discussed in a subsequent section) these quantities 
are readily calculated, so that this method has been particularly suitable 
to the study of amplitude dependent internal friction. In other 
methods of forced vibration, it is more convenient to keep F Q constant 
and determine Q" 1 (equation (23)), which requires only a knowledge of 
relative amplitudes and does not require that F Q be known. The latter 
methods are, therefore, only useful when internal friction is independent 
of amplitude. 

In addition to the possibility for studying internal friction by the 
determination of the resonance curve in forced vibrations, the oldest 
and most popular methods are related to determination of the damping 
of free vibrations, where vibrations, once excited, are allowed to continue, 
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with no external forces applied to the system ; the rate of decrease of 
amplitude is then determined. In order to solve the equation of motion 
18', for F a = (no external forces), we substitute a solution of the form 

= 6 e m ; a> = co(l+ id/2ir) 

which represents exponentially damped oscillations, if d is a constant. 
(0 is now ifeal.) When this solution is substituted into equation (18), 

and cognizance taken of equation (19), we obtain : /co 2 k ; if real and 
imaginary parts are now separately equated, and second order terms in 
internal friction neglected, it is found that 

ft) 2 = k/I ; d = TT</ .... (25) 



The quantity d represents the natural logarithm of the ratio of ampli- 
tudes in two successive vibrations and is called the logarithmic decrement 
(or simply, the "decrement"). When this quantity is small (as always 
assumed in this discussion) it is also equal to the fractional decrease in 
vibrational amplitude per cycle. In order to obtain d from experiment, 
it is simply necessary to plot the logarithm of the amplitude of vibration 
against the number of cycles of vibration. If internal friction is 
independent <ff amplitude a straight line is obtained whose slope gives 
the decrement directly. However, if internal friction is amplitude 
dependent, this curve will not be a straight line, but its slope at any 
amplitude will give the decrement at that particular amplitude. Thus 
an entire curve of decrement versus amplitude of vibration may be 
obtained from a single decay curve if the precision of the data is 
sufficiently good. 

Equations (21) arid (25) also show that the natural frequency in free 
vibration and the resonant frequency co r in forced vibration are the 
same. This resonant frequency is affected by the existence of a AJf 
effect ; under conditions where non-elastic behaviour does not occur, 
its value would be a)' = (k'/I) 11 *, where k r is the force constant corres- 
ponding to the modulus M' . Because of the AJkf effect, there will be a 
decrease in resonant frequency and, if we define Aco = co' o> r , we 
find that 

Ao)/co r = \kM\M .... (26) 



Thus, the existence of a A M effect is synonymous with a decrease in the 
resonant frequency or in the frequency of free vibration. Also, as 
already noted, amplitude dependent internal friction is usually accom- 
panied by an amplitude dependent dynamic modulus. This effect is 
letected by a change of resonant frequency, or of the frequency of free 
vibration, with a change in the amplitude of vibration. 

Another measure of internal friction is the fractional decrease in 
vibrational energy per cycle. Inasmuch as d is the fractional decrease 
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in amplitude per cycle, and the vibrational energy is proportional to the 
square of the amplitude, it follows that 

6 = AJF/2TF 

where W is the vibrational energy and ATF is the loss of vibrational 
energy in one cycle. A closely related quantity which is widely used in 
engineering applications is the damping capacity which is the per cent 
decrease in vibrational energy per cycle, so that : damping capacity 
= 2006. 

The relation between the various measures of internal friction are 
summarized as follows : 

</> = d/7T= &W/27TW = Q~ l .... (27) 

The last equality applies only when internal friction is amplitude 
independent. These simple relationships between the various measures 
of internal friction are derived under the assumption of small damping 
and break down when this assumption is no longer valid. For large 
damping, the relations between different measures of internal friction 
depend on the mechanism of the damping and must be derived separately 
for each case. 22 

The measured values of and &M/M will not orly depend upon the 
frequency and possibly the amplitude of vibration, but will also depend 
on the type of stress employed. The relation between values obtained 
in shear and in longitudinal vibration, for example, will depend on the 
mechanism of the non-elastic behaviour. 

PHENOMENOLOGICAL THEORY: NON-HOMOGENEOUS STRAIN* 

In practice, damping is rarely, if ever, studied under conditions where 
the mass is all concentrated in an auxiliary inertia member, and the 
stress and strain are homogeneous throughout the "elastic" member. 
Very often, the material whose elastic and non-elastic properties are to 
be studied, will be in the form of a bar which is vibrated in a longitudinal, 
transverse, or torsional mode of oscillation. Under these circumstances 
the stress amplitude is not constant throughout the specimen. It will 
be shown in this section, however, that the interpretation of vibration 
experiments on these more realistic specimens is completely analogous 
to the case of homogeneous strain, except that special attention is 
required when the internal friction is amplitude dependent. 

It is well known 23 that a perfectly elastic system will vibrate in 
certain normal modes of vibration characteristic of the system. Once it 
is vibrating in one normal mode it will continue to do so indefinitely. 

* The reader who is not interested in the details of this section may omit the section 
without loss of continuity. 
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If small non-elastic effects are introduced, it is to be expected that the 
system will still vibrate in the same normal mode, except that now it 
Will diminish in amplitude due to energy losses. Such behaviour will 
actually occur when the stress-strain equation is linear; in the case 
where nonlinear terms occur, the solution of the equation of motion is 
no longer a single normal mode of vibration. It is useful, however, to 
consider only the components of the vibration in a single normal mode, 
just as we considered only fundamental Fourier components in the case 
of homogeneous strain when a nonlinear stress-strain equation occurred. 
This extension of the method of equivalent linearization to vibration 
under non-homogeneous strain has been justified experimentally 11 over 
a range of amplitudes in which the internal friction reaches twice its 
value at low amplitudes. 

In most of the experimental methods a simple stress system is in- 
volved. We may, therefore, continue to describe the vibration in terms 
of a single component of stress a and the corresponding principal strain 
e. When the system vibrates in one particular normal mode, the strain s 
may be written in the form 

e = C(t)Q(x,y,z) .... (28) 

where the space*dependent function G is characteristic of the mode of 
vibration (an eigenfunction of the vibration equation). By means of this 
separation of variables, the problem of vibration in a normal mode is 
reduced to a one-dimensional problem similar to that of homogeneous 
strain, since only the co-ordinate (7, which specifies the extent of excita- 
tion, is needed in order that the strain at every point in the specimen 
be known. A system vibrating very nearly in a normal mode, under 
small damping, may, therefore, be represented by an equivalent simple 
vibrator, whose equation of motion is of the form 18, where the quantity 
C is now the displacement. For a perfectly elastic system, C C Q e itot ; 
on the other hand, if the specimen displays non-elastic behaviour, the 
quantity C will show damping in free vibration or a resonance curve in 
forced vibration. It is necessary to relate the damping (j> s of the equi- 
valent vibrator, which represents the behaviour of the specimen as a 
whole and the dynamic modulus M s , obtained from the resonant 
frequency of the specimen, to the corresponding properties (/> and M of 
an element of specimen material. If the stress-strain equation is linear 
and the quantities M and cf>, therefore, independent of vibration 
amplitude, we may anticipate that <j> 3 = </> and M s = M. If, on the 
other hand, M and < are functions of the strain amplitude e l5 within a 
small element of the specimen, the quantities <, and M 8 will depend 
upon the amplitude of vibration C 1 of the specimen. 

To obtain the relations between the quantities that represent the 
behaviour of the specimen as a whole and those that represent the 
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properties of the specimen material, it is convenient to employ the 
notation of the generalized Maxwell approach introduced In the last 
section. The quantity C may be divided, in the usual way, into G' and 
C"', so that the parameters characteristic of the specimen are : 

^CSM .... (29) 



where Ai/ g = M ' M s by analogy to the definition of AJf . Now, 
consider an element of the specimen for which the total strain is e and 
non-elastic strain e". The internal friction 



is, in general, a function of the strain amplitude e v If the specimen is 
assumed to be vibrating very nearly in a single normal mode, as in 
equation (28), we have 

/ = (7/0; 2 " = CVG 

If these last two expressions are inserted into equation (30), we obtain 

.... (31) 



This equality cannot be strictly true unless <f> is a constant (since 
dividing through G gives a space dependent function du the right and a 
constant on the left, if $ ^ constant), i.e. as already stated, it is not 
strictly correct to assume that only a single mode of vibration is 
involved when (/> is not constant. Inasmuch as any function of co- 
ordinates of the specimen can be expanded in terms of a set of eigen- 
functions, we may take components of the two sides of equation (31) 
corresponding to the normal mode G. To do this, we multiply both sides 
by G and integrate over the volume of the specimen to obtain 



where dr represents an element of volume of the specimen. Equation 
(32) relates the experimentally measured function <t> s (G\) to the function 
</>(%), which is a property only of the specimen material. The equation 
shows, as we have already anticipated, that only in the amplitude 
independent case is the internal friction measured on a specimen vibrating 
under non-homogeneous strain independent of the mode of vibration, 
i.e. dependent only on the properties of the specimen material. In 
exactly the same way, the result 



/AJfA 

\MJ ~ 



is obtained. 

In summary, it has been shown in this section that the only difference 
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in the analysis of the non-elastic behaviour of a specimen vibrating in a 
normal mode and one for which the strain is homogeneous throughout, is 
that in the case of non-uniform strain the measured values of internal 
friction and the dynamic modulus represent a weighted average (as 
given in equations (32) and (33)) of the corresponding properties of the 
specimen material, when these properties are amplitude dependent. 


VlSCOELASTICITY AND STATIC HYSTERESIS 

In the phenomenological treatment of internal friction given so far, 
MAXWELL'S first assumption (equations (8) and (9)) has been used, 
while his second assumption (equation (10)) has been eliminated on the 
grounds that it is too restrictive for a description of the non-elastic 
behaviour of most materials. It has been possible to introduce the A Jf 
effect and to show the relationship between the various measures of 
internal friction without substituting any other assumptions in place 
of equation (10). In order to obtain any specific information, such as 
the variation of internal friction with frequency or amplitude of 
vibration, it is obviously necessary to introduce expressions which give 
further information about the non-elastic strain. 

The "Standard Linear Solid" 


A very important type of non-elastic behaviour is shown by materials 

that have been called 5 * 'standard linear solids." Such materials may 
also be said to show a "single time of relaxation." The basis for the 
behaviour to be described is the existence of an internal parameter p 
which takes on a different equilibrium value p at each value of the 
stress and whose variation is accompanied by a noil -elastic strain e" '. 
The following three conditions are assumed to apply: (1) the non- 
elastic strain is proportional to p 

e" = 2,p .... (34) 

(2) The equilibrium value of p, denoted by p, is proportional to the 
stress. (3) The parameter p approaches its equilibrium value p, under 
constant stress, according to the relaxation equation, 

P = -(p-p)/r .... (35) 

so that the difference p p goes to zero exponentially, with relaxation 
time r. From conditions (1) and (2), the equilibrium non-elastic strain 
e" (= Ap) is proportional to stress, and, therefore, to the elastic 
strain e', or 

.... (36) 



where the dimensionless proportionality constant A M is called the 
relaxation strength and must be much less than unity if our assumption 
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of small damping is to be valid. When equations (34), (35) and (36) are 
combined they give, as an equation for e", 

e" = -(l/T)(e'-A j; ') .... (37) 

This result replaces equation (10) in the Maxwell approach and is 
essentially the same as that equation except for the presence of an 
extra term on the right-hand side. When a harmonic* solution is 
substituted into equation (37), there results 

(38) 




Fig. 2. Frequency dependence of internal friction and AM effect for a 

standard linear solid. The functions are plotted for unit 

relaxation strength, & M 

When equations (13) and (14) are substituted for K" and e' and the real 
and imaginary parts of equation (38) separately equated, the internal 
friction and ATI/ effect are obtained directly : 



0)T 



+ 



- (39) 



(40) 



(41) 



Because of the assumptions of linear relationships (equations (34) and 
(36)) the internal friction and AM effect turn out to bejndependent of 
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vibration amplitude. The internal friction curve is a symmetrical peak 
when <f> is plotted against log (cor), as shown in Fig. 2, with a maximum 
value equal to A 17 /2 at cor = I/ The interpretation for the occurrence 
of such a peak is as follows : the internal parameter p will constantly 
strive to attain the appropriate equilibrium value p at every value of 
stress, i.e. the relaxation process 
(equation (35)) is constantly going 
on. When the period of vibration 
is very short compared to r, or 
o>T^>l, practically no relaxation 
is possible during a cycle because 
the stress is changing rapidly com- 
pared to the time for achievement 
of equilibrium. Under these con- 
ditions e" is practically equal to 
zero, the behaviour of the material 
is essentially perfectly elastic, and, 
therefore, AJf = 0, in agreement 
with equation (40) and Fig. 2. 
The dynamic^ modulus M that is 
measured at sufiicjently high fre- 
quencies is, therefore, equal to M ' ; 
it is also known 5 as the unrelaxed 
modulus, inasmuch as it is measured 
under conditions where no relaxa- 
tion occurs. Fig. 3 (a) shows the 
proportionality between stress and 
strain at high frequencies. The 
other extreme, cor <^ 1, corresponds 
to a period of vibration so much 
longer than r that there is sufficient time for achievement of very 
nearly equilibrium values of the parameter p at every value of stress. 
Under these conditions the strain at every value of stress is e' + s" ', 
which is a unique function of stress. As illustrated in curve b of Fig. 3, 
the stress-strain relationship is again a straight line and, therefore, <f> is 
essentially zero ; the dynamic modulus M , which is the slope of this 
straight line is lower than M' ', and, in fact, is equal to M'(l A M ) 
(from equation (41)). This value is known 5 as the relaxed modulus, since 
it is measured under conditions where relaxation is always essentially 
complete. Between these two extreme cases, stress and strain will not 
be unique functions of one another, since relaxation cannot keep up 
with alternations in stress. The stress-strain curve is then an ellipse 
whose area attains a maximum value when COT = 1 . The slope of the 
major axis of the ellipse is the dynamic modulus M and always falls 
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Fig. 3. Stress-strain curves in a 
standard linear solid: (a) at very 
high frequencies, (b) at very low 
frequencies. In both cases stress 
and strain are in phase and there 
is no damping 
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between the two extreme values of Fig. 3. The variation of the quantity 
&MJM with frequency is shown in Fig. 2, which is a plot of equation 
(40). It is interesting that only in the range where damping is high is 
there an appreciable dependence of the dynamic modulus on frequency. 
If measurements are made only at low frequencies, there is no way of 
distinguishing the behaviour of the specimen from that of a perfectly 
elastic substance whose modulus is M'(l A 3/ ). 

The standard linear solid subjected to a given type of vibration is 
completely described by three independent constants, Jf ', A A/ and r, 
in contrast to the one constant, the elastic modulus, of a perfectly 
elastic body. The appropriate modulus depends on the type of experi- 
ment and in most cases will be either Young's modulus E or the rigidity 
modulus G. The relaxation strength A^ is a measure of the magnitude 
of the non-elastic effects, i.e. the height of the internal friction peak and 
the value of f^MjM at very low frequencies. It is a quantity that should 
be small compared to unity in order that the assumptions made in 
deriving the relationship between different measures of internal friction 
be valid. Actual observed values of & M have ranged from 10~ 4 to 
greater than 10" 1 . The latter value may seem to violate our requirement 
of A^ f <^ 1, but the error in the application of the above theory to the case 
of A 3/ 0-1 is not serious. The reason for designating the appropriate 
modulus as a subscript to A is that the relaxation strength is dependent 
upon the type of vibration experiment. For example, in many cases 
relaxation is produced only by shear stress. Thus, if a vibration experi- 
ment were conducted under hydrostatic stress only, we would have 
A^ (K bulk modulus). In a torsion experiment, A^ is involved 
and in tension or flexure A#. However, since in tension only the shear 
components would produce relaxation, the relation between A# and A# 
is readily obtained 5 from elasticity theory as 

& =^G/E)^ E .... (42) 

For the usual values of G/E, A^ is about 15 per cent greater than A#. 
The third independent constant of the standard linear solid is the 
relaxation time r. This parameter is essentially the mean time for the 
appropriate internal re-arrangements to occur which will lead to the 
attainment of the equilibrium value of the internal parameter p. In 
the important case where a change in p occurs by atomic re-arrangement 
dr diffusion, we have at our disposal a valuable method for obtaining the 
mean atomic jump time. This application of internal friction measure- 
ments will be discussed later. Whenever atomic movements of any kind 
are involved in the relaxation process it is to be expected that r will be 
strongly temperature dependent, according to the Arrhenius equation, 



^ (43) 
18 



INTERNAL FRICTION IN METALS 

where His a heat of activation, R is the gas constant, and T the absolute 
temperature. The existence of such a relationship can be used to great 
advantage when it is desired to obtain the internal friction peak 
experimentally. If this peak is to be obtained by varying the frequency 
it requires measurements over two or three powers often in co (see Fig. 2). 
Such measurements usually are experimentally difficult. Inasmuch as 
<f> is dependent only on the product cor, the same end can be achieved if 
<f> is observed as a function ofr (by varying the temperature) at constant 
a) as by a study of as a function of o> at constant r. A similar curve to 
that of Fig. 2 may, therefore, be obtained by plotting <f> against T~ l 
when r obeys equation (43). This curve will still have a maximum when 
C&T = 1, except that r is now t}ie variable. Curves of </> against T" 1 
obtained at two different frequencies o^ and co 2 can be superposed by a 
horizontal shift, and from the magnitude of this shift the heat of activa- 
tion H can be calculated as follows : for the same value of </>, O^TJ 
= ft> 2 T 2 , where the subscripts 1 and 2 refer to the two curves. Therefore 



or 

ln(co 2 K)^|.A(l/T) .... (44) 

so that from the experimentally observed shift, A(1/T), between the 
curves for frequencies co x and eo 2 , the quantity H is determined. Equa- 
tion (44) is more generally valid than the case of a standard linear 
solid, since its derivation is dependent only on the fact that (f> is a 
function of o> and r through the product cor, and not on the form of 
that function. 

Superposition of Relaxation Times 

Although there are several physical mechanisms in metals and other 
solids that lead to non-elastic behaviour describable by equation (37), a 
material whose internal friction curve is given precisely by equation (39), 
over the entire frequency range, has never been found. On the one hand 
one may find more than one such peak, resulting from different mecha- 
nisms and located at different positions on the frequency scale. Thus, 
an acoustic or relaxation spectrum 5 is said to exist, somewhat in analogy 
to the optical spectrum. On the other hand, any single peak of the* 
relaxation spectrum may deviate to a lesser or greater extent from the 
theoretical curve of equation (39). The reason for this deviation is 
related to fluctuations in the time of relaxation at different positions in 
the specimen. For example, if the relaxation process is controlled by 
atomic diffusion, the time of relaxation will be closely related to the 
mean atomic jump time. This latter quantity is capable of fluctuating 
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in accordance with fluctuations in the number of impurity atoms and in 
the state of internal stress in the immediate neighbourhood 'of the atoms 
under consideration. Usually, therefore, the term e" will be made up of 
a series of terms each obeying equation (37) but with slightly different 
values of r. The observed internal friction and AM effect may then be 
a superposition of terms of forms given by equations (39) and (40), with 
different values of r. The existence of a range of relaxation times, no 
matter how narrow, in place of a single relaxation time, will always result 
in a broadening of the internal friction peak. Equation (39), therefore, 
serves as a theoretical limit which is very closely approached in some 
cases (e.g. interstitial diffusion in b.c.c. lattices) while in other cases 
(such as grain -boundary internal friction) considerably wider peaks are 
observed. These examples will be discussed more fully in subsequent 
sections. 

When the observed internal friction is the result of the superposition 
of mechanisms which have a range of relaxation times, but for which each 
value of r obeys equation (43), there are two possibilities that must be 
considered : ( 1 ) the various values of r differ from each other only in 
the values of T O ; (2) different values of H are involved. The first case 
(probably illustrated by grain-boundary internal friction) occurs when 
the various values ofr are each different multiple^ cfif a unit jump time 
T I? which itself obeys an Arrhenius equation. Such a situation might 
arise if, in different parts of a material, relaxation is produced by the 
occurrence of a different number of atomic jumps, each jump requiring 
a mean time r v In this case, the internal friction curve, no matter what 
its form, is still dependent on frequency and temperature through the 
product coe H l RT . The arguments leading to equation (44), therefore, still 
apply and the value of H may be obtained from the horizontal shift of 
the curve of </> versus T~ l with a change of frequency. In the second 
case, in which H is not constant, a change of frequency produces a change 
in the shape of the internal friction curve, since its various components 
are shifted by different amounts. If the change of shape is not too great, 
an average value of H may be obtained from the average horizontal 
shift. 

Anelasticity 

The material that shows a single time of relaxation and one which is 
rdescribable in terms of a superposition of relaxation times are the most 
important examples of that field of non-elastic behaviour that ZENER 4 > 5 
has called "anelasticity." In order that an imperfectly elastic solid be 
said to show anelasticity two conditions are required: (1) the stress- 
strain equation should be linear ; (2) if a static stress is applied to such a 
material and then removed, no permanent set shall remain, i.e. any 
residual strain that exists after the stress is removed ^hall decay back 
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to zero if sufficient time is allowed. As an example, in the case of the 
standard linear solid, if equations (8) and (9) are combined with (37) it is 
possible to eliminate e' and e" to obtain a linear equation in e, e, a, and 
a (the stress-strain equation). (This equation may be generalized 24 by 
including higher derivatives of e and a.) Anelasticity excludes ampli- 
tude dependent effects, for as long as condition (1) applies, the internal 
friction and AJlf effect will be independent of amplitude. As for 
condition (2) in the case of the standard linear solid, it follows from the 
defining assumptions that there must be only one equilibrium value of 
strain for every value of stress. It is readily shown that the Maxwell 
solid does not obey the second condition for anelasticity, although it 
elearly satisfies the first condition. (If the stress is brought to zero after 
non-elastic deformation has occurred, equation (10) states that e" = 0, 
i.e. there is no recovery ofthe non-elastic strain in the Maxwell solid.) 

Coupled Relaxations 

It is possible that if several sources of relaxation with different values 
of r are operating simultaneously, the internal friction may not be a 
simple superposition of the values that would be found if each were 
acting separatgly. If the relaxation of one source affects the conditions 
at the second, coupling may be said to occur and the resultant relaxa- 
tion may be considerably greater than that obtained by superposition. 
That such coupling of relaxations can occur has been noted recently 25 in 
connexion with an irregular arrangement of interfaces in a metal. The 
phenomenon of coupled relaxations, which also falls into the category 
of anelasticity, will be further discussed in a subsequent section, in 
relation to the mechanisms that produce it. 

General Viscoelastic Behaviour 

Although most studies of the mechanism of internal friction in metals 
have thus far centred about anelastic effects, the two conditions for 
anelasticity have been too restrictive to be as widely applicable in the 
study of high polymers. Workers in the latter field have found 20 that 
in many cases, it is not possible to give adequate description to the 
mechanical properties of polymeric substances in terms of linear 
equations. They have also found 27 that equations which call for viscous 
flow and which, therefore, permit the occurrence of a permanent set, 
are also necessary to interpret the experimental behaviour of these 
materials. Any material which can be described in terms of a differential 
equation in stress, strain, and the time derivatives of stress and strain, 
whether or not the equation is linear or implies the possibility of a 
permanent set, is called viscoelastic. In metal physics the occurrence 
of viscoelastic behaviour outside of anelasticity has not been widely 
reported, probably because extensive enough investigations have not 
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been made and also because most investigations have been conducted at 
low temperatures where permanent set is less likely to occur. In one 
investigation 28 ' 29 of internal friction at relatively high temperatures, it 
has been established that the mechanism of damping is one that 
corresponds to a viscous-type flow under static stress. 

On the basis of present experimental evidence it seems as if any 
viscoelastic behaviour in metals (even the type that produces a per- 
manent set) does not result in strongly amplitude dependent internal 
friction at the low strain amplitudes at which most experiments are 
conducted. The maximum strain amplitude ordinarily employed is of 
the order 10~ 5 or lower. The reason for this independence of amplitude 
below strains of 10~ 5 is probably that even where a nonlinear equation 
is involved, it can usually be approximated adequately by linear 
expressions, at these low amplitudes. Amplitude dependent internal 
friction plays a more important role in the case of static hysteresis, 
which will now be discussed. 

Static Hysteresis 

In this section, it will be shown that a material that manifests no 
viscoelastic behaviour may still behave non-elasticall^ and produce 
internal friction and a AJf effect, as a result of a nfechanism entirely 
different from viscoelasticity. 

Internal friction can only occur when stress and strain are not 
unique functions of one another in vibration, that is, wfien the dynamic 
stress-strain curve is a hysteresis loop. In viscoelastic materials, 
hysteresis arises as a consequence of the appearance of time derivatives 
in the stress-strain equation coupled with the fact that the experiment 
is conducted under oscillatory stresses. The occurrence of the hysteresis 
loop is a consequence of the dynamic nature of the experiment and the 
area of the loop is very much dependent on frequency. In the case of an 
anelastic material, for example, if an experiment is conducted statically 
with stress applied and released sufficiently slowly to keep the material 
constantly in equilibrium, the area of the hysteresis loop will go to 
zero ; stress and strain will be uniquely related. We have already seen 
such behaviour in curve b of Pig. 3. Hysteresis may also arise, in 
materials that are not viscoelastic, in the following way : suppose that 
as the stress is increased there is a definite value of strain for each stress, 
this strain being attained instantaneously and, therefore, independently 
of the rate of loading. Now, when a particular maximum strain is 
attained, the stress is released. Again it is assumed that at each value of 
the decreasing stress the strain instantaneously adjusts itself to an 
appropriate value, but the strain, for a given stress during unloading, is 
not the same as the corresponding value during loading. A permanent 
residual deformation remains after removal of the stress which can only 
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be brought back to zero strain by the application of a reversed load. 
This type of hysteresis loop, illustrated in Fig. 4, is the same whether it 
is traversed statically or dynamically ; it is entirely independent of the 
frequency of vibration. The stress-strain equation, therefore, does not 
involve time derivatives, as in the case of viscoelasticity , but is, simply, 
a multivalued function. This phenomenon is well known in the case of 
large-scale plastic deformation at low temperatures where the deforma- 
tion is practically instantaneous, so that the amount of deformation is 
almost independent of the 
rate of loading. It was with 
regard to such hysteresis, 
obtainable in large-scale 
plastic flow, that BuRGEBS 30 
suggested the term static 
hysteresis in contrast to the 
dynamic hysteresis displayed 
by viscoelastic materials. 
The fact that static hysteresis 
occurs for stresses greater 
than those rogjiired to start 
plastic flow may*be impor- 
tant in fatigue studies, but 
certainly not in internal fric- 
tion at very low amplitudes. 
It has been known for some 
time, however, that ferro- 
magnetic substances, because 
of the phenomenon of mag- 
netostriction, will follow a 
mechanical hysteresis loop 
that is independent of fre- 





Strain 



Fig. 4. Schematic illustration of a static 

hysteresis loop. The same loop is obtained 

regardless of the rate of traversal 



quency and which gives rise to internal friction at low amplitudes. 
Recently it has been demonstrated 11 * 12 that damping related to the 
motion of dislocations in a single crystal originates from a static hystere- 
sis loop at strain amplitudes as low as 10~ 7 .* It, therefore, appears that 
static hysteresis may occur at exceedingly low amplitudes, and that it 
may result from atomic as well as magnetic re-arrangements. Obviously, 
such re-arrangements do not actually occur instantaneously, but am 
probably propagated with the velocity of sound. Such velocities are 
sufficiently * 'instantaneous" at frequencies normally used in vibration 
experiments. 

In contrast to viscoelastic internal friction, that originating in static 
hysteresis seems to show a marked amplitude dependence even at very 

* The writer in h\p original description of this behaviour called it "simple hysteresis." 
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low amplitudes. Such observations are not at all surprising because the 
static hysteresis loop will, in general, deviate strikingly from a linear 
relationship (as, for example, Fig. 4). 

In order to calculate <f> and &M/M for a static hysteresis loop, it is 
convenient to consider only the loop of non-elastic strain s" versus 
stress. Let us represent the slope of these curves, de"/da, by* 

d "/da = F(a) (loading) 

= /(cr) (unloading) .... (45) 

If a = a l cos wt, where a l is the amplitude, s" is periodic but in general 
not simple harmonic. It is desired, in the usual way, to take the funda- 
mental Fourier components of e" in phase with and 90 behind the 
stress. These components, e/ and e 2 " 'respectively, may be obtained by 
Fourier analysis (after partial integration to obtain integrals of de"/da 
rather than of e") and inserted into the expressions (15) and (17) to give 

2M' f w/2 
<f> = ------ \F(ai cos 0) ffa cos 6)] cos sin Odd . . . . (46) 

77 JO 

AM 2M' f " /2 

- ----- [F(a l cos 6) + /(GI cos 6)] sin 2 Odd .... (47) 

M 7T Jo 

In the derivation of equations (46) and (47) it is Assumed that F and/ 
are even functions, i.e. the two halves of the hysteresis loop are identical. 
Such an assumption will generally be valid under steady-state condi- 
tions. In the case of the parallelogram loop, where F and*/ are constants, 
the quantities (f> and &MJM are independent of the stress amplitude 
a v In the more general case, it may be convenient to use power series 
expansions for the functions F and/: 

F(a) =^A Q + Ai\a\ + A^ + . . .} 

/(or) = a + a L \a\ + a 2 <r 2 + . . . j 

The absolute value \a\ appears in the linear term because F and / 
are even functions. When these expressions are substituted into 
equations (46) and (47) and the appropriate integrals evaluated, the 
amplitude dependence of ^ and &M/M may be obtained as power series 
in the strain amplitude 

M' 

<f> = [( A, - a ) + f M (A l - a^ + 



.... (49) 

* It is assumed, in this treatment, that the functions F and / are independent of 
the stress amplitude. This assumption is a restrictive one, and the question of its 
validity can only be determined when the mechanism of the non-elastic behaviour is 
known. 
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where we have substituted M e : *= a v In general, if there is no reason 
to expect fhat the stress -strain' curve is a parallelogram loop, the 
occurrence of static hysteresis is closely associated with amplitude 
dependent effects even at very low amplitude. This characteristic 
amplitude dependence together with the frequency independence of 
internal friction resulting from static hysteresis make it possible to 
distinguish fchis type of behaviour from viscoelasticity. 

Summary of Phenomenological Theory 

The various mechanisms that produce damping may be classified into 
the following categories of non-elastic behaviour : 

I. Viscoelasticity ( dynamic ^hysteresis) 
A. Anelasticity 

1. Single time of relaxation (standard linear solid). 

2. Superposition of relaxations. 

3. Coupled relaxations. 

J3. Processes involving viscous flow. 

II. Static hysteresis. 

More complex Behaviour, which may be represented as a combination 
of viscoelasticity and static hysteresis, is also possible but has not been 
investigated as yet. 

METHODS OF MEASUREMENT 

The methods most commonly used in the measurement of internal 
friction will be classified and the general principles discussed briefly 
in this section. The specific variations of each general method have 
been too numerous to make a detailed discussion of them possible here. 

The Torsion Pendulum 

This general type of method is essentially distinct from the others to be 
discussed, in that it is a low frequency method. The use of an auxiliary 
inertia member of relatively large moment of inertia compared to that 
of the specimen to which it is attached, usually results in a low torsional 
frequency of vibration, in the range 0-1 to 15 vibrations per second. 
The damping is determined by observation of the rate of free decay, 
i.e. the logarithmic decrement (equation (25)), usually by means of an 
optical method, where a beam of light is reflected by a mirror mounted 
on the inertia member and viewed on a translucent scale at an appro- 
priate distance from the mirror. The initial excitation of vibrations is 
usually accomplished by making part of the inertia member out of a 
ferromagnetic material and suitably placing an electromagnet so as to 
attract this part when a switch is closed to activate the magnet. 
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If the frequency of vibration is less than about 2 sec 1 it is possible 
to obtain the amplitude as a junction of number of vibrations either 
visually or by automatic (usually photographic) recording ; at higher 
frequencies the automatic method must be used. 

The torsion pendulum method is the oldest and simplest method for 
measuring internal friction. This method has a long history, dating 
back to COULOMB at the end of the eighteenth century, and having 
recently come into wide use in such modifications as those of 
CHEVENARD 31 and BouLANGER 32 (the Coulomb micropendulum), 
KE, 33 and FiNE. 34 The torsion pendulum of KE has been imitated by 
several authors and used in most of the recent measurements at low 
frequencies. It requires a specimen in wire form, usually 6 to 12 in. long 
and about 0-03 in. in diameter. The Coulomb micropendulum, on the 
other hand, uses a specimen about \ in. long and 0-06 in. in diameter. 
Automatic recording methods have been used by several experi- 
menters. 32 ' 34 > 35 

The strain amplitude in a cylindrical specimen of circular cross 
section, vibrating in a torsion pendulum, is variable, from a maximum 
value at the surface to zero at the cylinder axis. The maximum 
amplitude of shear strain in the specimen is usually of the order 10~ 5 . 
The torsion pendulum is particularly useful for large Values of damping 
up to values of (/> of the order 0- 1 , while the lower limit is not much 
below 10~ 4 , even when the chamber is evacuated to eliminate air 
damping effects. 28 The study of materials that show vet*y low damping, 
of the order 10~ 5 , has not been possible, thus far, with the torsion 
pendulum because of the background damping due to losses in the 
apparatus. 

It is possible to obtain the amplitude dependence of internal friction 
from a decay curve in free vibration, as already pointed out in the last 
section. The dependence of vibration frequency on amplitude is much 
more difficult to determine by this method. To obtain the dependence 
of internal friction on temperature, the specimen is placed inside a 
vertical tubular furnace, preferably noninductively wound. 

High Frequency Methods 

These methods are usually characterized by the absence of an auxiliary 
inertia member ; the specimen, in the form of a rod, is usually vibrated 
in one of its normal modes. Both free decay and steady-state forced 
vibrations may be studied. The frequency range varies depending on 
the type of vibration produced. For example, in torsional or longitudinal 
modes of vibration of cylindrical specimens, the frequency, which 
depends primarily on the length of specimen, usually ranges from 10 4 
to 10 6 cycles per second. Flexural vibrations of a rod, on the other 
hand, usually are observed in the frequency range 300 {.o 10,000 cycles 
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per second, and the frequency depends on both the diameter and length 
of the specijto en. 

There are a wide variety of methods that have been used for the 
excitation and detection of vibrations at high frequencies. In most of 
them the power source is an electrical oscillator. The methods generally 
differ in the manner by which electrical oscillations are converted into 
the mechanical vibrations of the specimen. Detection consists of a 
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Kig. 5. Schematic diagrams of the apparatus for various high 
frequency methods of measurement of internal friction 

means for reconverting mechanical into electrical vibrations with sub- 
sequent amplification. Most investigators have employed parallel 
schemes for excitation and detection, but in some cases completely 
different methods have been used for these two purposes. The most 
common high frequency methods, which are illustrated schematically in 
Fig. 5 (taken from SiEGEL 36 ), are as follows. 

(a) The electrostatic method 37 " 39 treats one end of the specimen as 
one plate of a condenser, applying an alternating potential difference 
between the specimen and an adjacent fixed plate to excite mechanical 
vibration. Detection of the vibration may be achieved by forming a 
similar condenser at the other end which is used as a condenser micro- 
phone. Frequency modulation has also been used 39 as a method of 
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detection. If the specimen is not a conductor, the electrostatic method 
may still be used if a metal foil is cemented to the specftnen. 40 The 
electrostatic method is suitable for longitudinal and flexural vibrations ; 
it may also be used for torsion 37 if electrodes are attached to the 
specimen in an appropriate manner. 

(6) The eddy-current method 20 ' 41 is limited to longitudinal vibration 
of specimens that are good conductors. In order to drive the specimen, 
one end of it is surrounded by a coil and placed in the field of a per- 
manent magnet. An alternating current from an oscillator through the 
coil induces eddy currents in the specimen which, in turn, interact 
with the magnetic field to set the specimen into vibration. If a similar 
coil and magnetic field are located at the other end of the specimen, 
the motion of the specimen in this magnetic field induces eddy currents 
in the specimen. These currents, in turn, induce an electromotive force 
in the coil, which may then be amplified. The amplitude of vibration 
produced in the specimen is very small, since the electromechanical 
coupling is weak. The method is, therefore, only suitable for the study 
of very low internal friction, where not too much power is needed to 
drive the specimen. Adequate shielding is required to prevent direct 
coupling between the detector and input coils. The chief advantage of 
the method is the fact that nothing is attached to r tfie specimen which 
might produce internal strains and, therefore, additional spurious 
contributions to the damping. 

(c) The magnetic method 42 " 44 requires that pole pieces (discs of 
ferromagnetic material) be attached to the specimen unless it is itself 
ferromagnetic. A coil, placed close to one of the discs is connected to 
the oscillator and the resulting alternating magnetic field interacts with 
this pole piece to drive the specimen. A second coil near the other disc 
develops an induced e.m.f. which is amplified and detected. The 
positioning of the pole pieces and coils determines the mode of vibration ; 
the method may be used for longitudinal, flexural, and even for tor- 
sional 42 vibration by appropriate arrangement of the pole pieces. An 
additional advantage of the method is that it may be used for materials 
that are not electrical conductors. It is particularly useful for the study 
of large damping, because of the relatively strong electromechanical 
coupling. On the other hand, the possibility that the cementing of pole 
pieces to the specimen produces internal strains which contribute to the 
damping, means that this method is not especially good for cases of 
very small damping. An interesting variation of this method is that of 
F6BSTEB, 45 in which the two discs are not directly cemented to the 
specimen but attached to it through the supporting wires. The speci- 
men is then driven by the oscillator through one wire and the vibrations 
detected through the other. The method is particularly useful for 
measurements at high temperatures, since the supporting wires may be 
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brought outside the furnace so that all the electrical equipment is at 
room temperature. 

(d) The piezoelectric method of QuiMBY 46 47 utilizes a quartz crystal, 
to which the specimen is cemented. Application of an alternating 
potential difference across the quartz sets the two cylinders into 
vibration as a single resonant body, or a "composite oscillator." 
Longitudinal and torsional vibrations are produced respectively, by 
means of an X-cut and a Y-cut quartz crystal. The positioning of the 
electrodes for these two cases is shown in Fig. 5d. The equivalent 
electrical circuit of this composite oscillator at the resonant frequency 
depends on the elastic and dissipative properties of both the quartz 
and the specimen, but the contributions due to the quartz crystal may 
be determined separately and taken into account. In particular, the 
equivalent resistance of the circuit, which may be measured in an a.c. 
bridge, 48 gives a composite internal friction from which that of the 
quartz alone may be subtracted. The contribution of the adhesive 
between quartz and specimen to the measured damping is small if the 
natural resonant frequency of the specimen is closely matched to that 
of the quartz crystal, so that a node of stress occurs at the junction. 
A range of values of maximum strain amplitude from about 10~ 8 to 
10~ 5 may be obt*aiijed. READ 9 has shown how this amplitude is calcu- 
lated from measurable quantities. This method has been the most 
important one for the measurement of amplitude dependent internal 
friction, because the internal friction at any amplitude may be obtained 
directly from the equivalent resistance at resonance, once the propor- 
tionality constant relating these quantities is known. The specimen 
need not be an electrical conductor in this piezoelectric method, but 
the requirement of close matching in the individual resonant frequencies 
of the quartz and specimen is an important limitation. In addition, it 
had been found 49 that strains at the junction resulting from differences 
in thermal expansion of the quartz and specimen, made significant 
contributions to the damping, and thus defeated attempts to measure 
internal friction as a function of temperature by this method. This 
difficulty is resolved 11 by the use of a liquid adhesive film ; however, the 
handling of the composite oscillator becomes more difficult when a 
liquid adhesive is used. 

In all the high frequency methods, the external contribution to the 
damping is limited only to losses in the supports, if the system is 
evacuated. The contribution to the damping resulting from the 
presence of air in the system is usually of the order 10~ 4 ; that due to 
the supports can be made considerably lower if the supports are located 
at displacement nodes. Thus, values of <f> as low as 2 x 10~ 6 have been 
reported 50 in evacuated systems. 

The high frequency methods are suitable for the study of either free 
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or forced vibrations. If the damping is sufficiently small and frequency 
not too high (i.e. not too large a decay in 1 sec), it is oft&n possible 
to measure directly the time of free decay to a particular fraction of 
the initial amplitude. More generally, electrical circuits may be used 51 " 54 
to time the reduction of amplitude by any predetermined factor. In 
contrast to the damping in free decay, which becomes easier to measure 
for low damping, the resonance curve in forced vibrations becomes 
more difficult to measure as damping gets very small, because of the 
sharpness of resonance. Electrical oscillators can be constructed of 
such stability and with sufficiently fine frequency controls, however, 
that a resonance curve for which Q~ l (equation (23)) is 10~ 5 may be 
measured without difficulty. It is only necessary that one be able to 
read frequency variations of better than one part in 1C 6 to obtain such a 
resonance curve ; the frequency itself need- only be known to about 
1 per cent. 

A method that utilizes frequencies in the megacycle range is the 
pulse technique, by which a short ultrasonic pulse is produced at one 
end of the specimen and its time of transit through the specimen and 
attenuation are measured. Although this method has been used success- 
fully for the measurement of elastic constants, 55 ' 56 4he measured 
attenuation may be, in a large part due to scatteripg' by imperfections 
rather than to absorption, because of the short wave lengths employed. 
This method, therefore, has not as yet been used for quantitative 
measurements of internal friction. 

STRESS-INDUCED ORDERING : A SOURCE OF INTERNAL FRICTION 

A discussion of order concerns itself with the distribution of different 
atomic species in a solid solution on various lattice sites. If there is any 
deviation from a completely random distribution in the manner in 
which lattice sites are occupied, the alloy is said to show a degree of 
order. The concepts of long range and short range order that have been 
so widely discussed, 57 are special cases of this definition. The applica- 
tion of stress to a system may result in a change in the state of order, 
if one already exists, or it may produce a state of order in an initially 
disordered solid solution. For stress to have such an effect, it is simply 
required that a particular change in the state of order result in the 
production of strain. This requirement is simply a consequence of the 
thermodynamic reciprocity relations. For example, in the study of 
piezoelectricity, it is well known 58 that if the application of stress 
produces electrical charges on particular surfaces of a crystal, then a 
voltage across these surfaces will produce strain (electrostriction). It 
will be of value to derive the reciprocity relation that applies to the 
change of order induced by stress. Let p represent a parameter that is a 
measure of the state of order. Deviation from randomjiess means the 
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existence of an energy term that favours ordering. Let u represent this 
ordering energy : the energy decrease per unit volume, per unit increase 
in p at constant strain (defined so that u > 0), or 

u= (- 3U/dp) e .... (50) 



where U is the total energy of the system per unit volume. If we anti- 
cipate that.i is not appreciably temperature dependent, we may esti- 
mate its value by considerations at absolute zero, where 

dU = ads udp 

The differential of U ae is, therefore, eda udp, and since it is a 
perfect differential, the reciprocity relation 

.... (51) 



is valid. The meaning of this relation is that if a change of order at 
constant stress produces strain, the application of a stress will increase 
the ordering energy and will, therefore, shift the equilibrium in the 
direction of a greater degree of order. 

In this section three examples of such stress-induced ordering will be 
discussed. In the first two cases the state of the material for zero 
stress is one of randomness or disorder ; in the third example, we will 
consider initially ordered materials. In all cases, it is expected that the 
behaviour of the material under stress will correspond very closely to 
that of the standard linear solid, since the three conditions that define 
such a solid are valid to a good approximation. First, it is reasonable to 
assume that the non-elastic strain that accompanies a small change in 
p is proportional to this change. Equation (34) is, therefore, valid if 
p is redefined as the difference between the state of order existing and 
that at zero stress, so as to obtain a direct proportionality between 
e" and p instead of a linear relationship. Furthermore, the change in 
the equilibrium state of order under stress is proportional to the change 
of ordering energy, u, which is, in turn, proportional to the stress (again 
assuming small changes). Finally, the rate of approach to equilibrium 
is determined by the difference between atomic jumps that lead to 
greater and to lesser degrees of order per unit time, which is in turn, 
proportional to the deviation of p from its equilibrium value. (For p 
equal to its equilibrium value p, the rate of occurrence of atomic jumps 
that act to increase p and those that act to decrease p must be equal J 
Equation (35) is, therefore, valid. The mechanical properties of a 
material in which stress-induced ordering occurs will, therefore, be 
those of a standard linear solid. In particular, measurement of internal 
friction as a function of vibration frequency will show the characteristic 
peak whose equation is given by (39). The time of relaxation r, 
associated with stress-induced ordering must bear a simple relation to 
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the mean frequency of atomic jumps, since the mechanism of the 
change of the state of order must be the same as that for volume 
diffusion. It follows that 

l/r = aF .... (52) 

where F is the appropriate atom jump frequency and a a dimensionless 
constant. Inasmuch as a change in the state of order of a material can 
be accomplished by a small number of atomic jumps, the magnitude of a 
is of the order of unity. It therefore appears that by simply locating 
an internal friction peak, which determines r according to the relation 
cor = 1, it is possible to measure atomic jump frequencies in appropriate 
alloys. 

The application of internal friction' measurements to the study of 
diffusion phenomena is based on equation ,(52). Let us contrast the 
internal friction methods to conventional diffusion measurements. In 
the latter case, there must be a concentration gradient initially present 
in the specimen, and one obtains the change in the concentration- versus - 
distance curves as a function time. From these curves a diffusion coeffi- 
cient may be obtained, which is related to the appropriate atomic 
jump frequency by the equation 

D = /3a 2 F . . . (53) 

where a is the lattice parameter, and /? a dimensionless constant which 
depends on the geometry of the lattice structure and i$ of the order of 
magnitude unity. Since diffusion is a random walk problem, the mean 
distance that an atom travels from its starting position is proportional 
to Z 1/2 (t time) and, therefore, to the square root of the number of 
atomic jumps. Thus, in order that diffusion takes place, even over 
distances of the order of 0-1 mm, a mean number of atom jumps of 
about 1C 10 to 10 11 is required. If such a large number of jumps is to 
occur within the time limit of an experiment, the jump frequency must 
be very great. The magnitude of F and, therefore, of D can be controlled 
by selecting the appropriate temperature, since both quantities obey 
an Arrhenius equation, 

D = D e- H l RT \ 

r = i> HIRT ) ' (54) 

where H is called the heat of activation for diffusion. Conventional 
diffusion measurements are, therefore, limited to high temperatures. 
For example, the temperature at which the jump frequency is 1 sec" 1 
is far too low for diffusion measurements, since the experiment requires 
a time of at least 10 10 sec or 300 years ! On the other hand, such a jump 
frequency can be measured very conveniently by internal friction 
methods when the period of vibration is of the order of 1 sec. The 
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restriction of conventional diffusion measurements to high temperatures 
has led to the following limitations : (1) only a small range of values of 
D can be measured (usually D can be obtained only over two to three 
powers often) ; (2) when diffusion is due to the movement of vacancies, 
it is not possible to learn more about the process by trapping non- 
equilibrium numbers of vacancies at low temperatures and detecting 
their presence by the abnormally rapid atomic movement that they 
produce ; (3) it is not possible to study the effect of dislocations (cold 
work) on diffusion because experiments are conducted above the 
recrystallization temperature ; (4) precipitation cannot be studied by 
direct measurement of the rate of atomic movement in supersaturated 
solid solutions. The use of internal friction methods for the study of 
diffusion phenomena, has already contributed towards getting these 
investigations out of the strait jacket of high temperature measurements. 
In those cases in which internal friction methods are applicable, con- 
tributions have been made toward the elimination of each of the 
limitations listed above. 

Examples of Interstitial Solutions in B.C.C. Lattices 

The study of ij^ress -induced ordering by internal friction methods has 
been most extensive in the case of interstitial solid solutions in b.c.c. 
(body-centred cubic) lattices. Most investigations have been concerned 
with carbon and nitrogen as interstitials in a-iron ; in fact, the internal 
friction effects of this general type were first discovered and explained 
for the case of a-iron. In early work 59 it was observed that the damping 
of steel tuning forks goes through a maximum as a function of tempera- 
ture and that the location of this maximum is dependent upon the 
frequency of the fork. SNOEK GO showed that interstitial solutes were 
essential to this damping peak by demonstrating that the effect no 
longer occurred when all traces of carbon and nitrogen were removed. 
He also first explained 61 the origin of the effect as follows : interstitial 
carbon arid nitrogen in b.c.c. iron are not located in the largest spaces, 
but at the centres of the cell edges, i.e. positions of the type (\, 0, 0), 
and at the face centres, which are equivalent positions. An interstitial 
atom located in one of these positions will produce a distortion that has 
tetragonal symmetry, i.e. most of the distortion is in the direction of the 
two closest iron atoms (one of the (100) directions). If the directions 
of the cube axes are designated as x, y and z, respectively, the inteiv 
stitial sites may be designated as #, y or z sites according to the direction 
in which maximum distortion is produced. Under zero stress, the three 
kinds of interstitial sites are occupied in a random fashion, in accordance 
with the concentration of interstitial atoms ; each type of site, there- 
fore, has one-third of the total number of interstitials. Since an excess 
of atoms in any one type of site would result in an elongation in the 
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corresponding direction, our reciprocity equation (51) requires that the 
application of a tensile stress in the z direction, for example, will lower 
the energy for atoms in z sites as against x and y sites. Such a tensile 
stress will, therefore, produce a redistribution of atoms such that the 
z sites are occupied preferentially. A state of order is thus induced by 
stress where a completely random distribution existed for zero stress. 

The atomic description given by SNOEK to explain jbhe internal 
friction peak observed in oc-iron is so complete, that a calculation of the 
anelastic parameters A M and r in terms of physical constants is readily 
carried out in terms of this description. It is convenient to define the 
parameter p as 

p = n z n/S .... (55) 

where n is the number of interstitials and n f the number in z sites, per 
unit volume. The non-elastic strain in the z direction is given by 
equation (34), and its equilibrium value at any stress, by equation (36), 
where M is now replaced by Young's modulus in the (100) direction, 
which we will call E. The quantity u, equation (50), is now the decrease 
in energy when one interstitial is brought from an x or y site to a z site. 
When equations (34) and (36) are combined, we obtain 

& E = AE'p/a . . . (56) 

In order to complete the derivation of A# and to obtain the relaxation 
time r, it is necessary to calculate the net rate of change of n z . This 
quantity is given by 

n z =r yz n y +r xz n x -(r zx + r zy )n z .... (57) 

where n v and n x are defined analogously to n z , T yz is the rate (or 
probability per second) of transitions from y sites to z sites, and similar 
definitions apply to r xz , F za , and F Z1/ . A consideration of the geometry 
of the lattice shows that, if all jumps of interstitial s occur between sites 
that are nearest neighbours, then an atom in a z site may move only into 
four positions of which two are x and two y sites (similarly for 
other transitions). Furthermore, the activation energy is no longer the 
same for jumps into and out of z positions. Fig. 6 shows that if the 
height of the barrier under zero stress is H, in accordance with equation 
(54), then the application of stress must raise the barrier for z x 
transitions by u/2, and lower the barrier for x z transitions by this 
same amount ; the energy difference between the z and x states, is u, 
as already mentioned. These arguments lead to the conclusion that 

r yz = r xz = jr e- ( *- i " 2)/M ' 

and 

r* v - r,, 
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where F is the rate of jump of interstitials under zero stress, and 
\ijkT <^ 1 because the applied stress is assumed small. These terms may 
now be inserted into equation (57) to obtain 



(58) 
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Fig. 6. The effect of a tensile stress in the z direction on the energy 
barrier for the jump of interstitial atoms between x and z sites 

where n x + n y has been set equal to n n z , and the term un z is approxi- 
mately equal to un/3, since u is already small, of first order. Comparison 
of equation (58) with equations (35) and (55) shows that 

r- 1 = i! T .... (59) 

(or, in terms of equation (52), a = 3/2), and 

p - U un/kT .... (60) 

If u is assumed to depend only on a and not on p (this assumption 
means that mutual interaction between interstitials is not important, 
and should certainly be valid for the low concentration of interstitial 
solutes usually encountered) then, from the reciprocity equation (51), 
u = ACT, where A is defined by equation (34). When this result is sub- 
stituted into equation (60) and the resulting expression for p inserted, 
into (56), we obtain the final expression for A^ : 

T .... (61) 



The relaxation strength A^, is, therefore, proportional to the solute 
concentration and varies inversely as the absolute temperature. To 
calculate A^ requires only a knowledge of A (de/dp) which can be 
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estimated 62 for the important case of carbon in oc-iron from the change 
in lattice parameters of tetragonal martensite as a function of carbon 
concentration. The constant that relates A# to n/T can also be deter- 
mined experimentally from the height of the internal friction peak for a 
known concentration of solute. The value of this constant is, however, 
dependent on the crystallographic direction of application of the tensile 
stress. The above derivation applies to a stress in the (itM)) direction ; 
if the stress were in the (ill) direction, SNOEK'S theory predicts 
that & E = 0, since all three tetragonal axes are then equally inclined 
to the direction of application of stress. The verification of this pre- 
dicted result by DuKSTRA, 63 who found, in single crystals of a-iron, 
that A# for the (ill) direction is negligible compared to its value in 
the (100) direction, is the most striking evidence of the correctness of 
SNOEK'S picture of the relaxation process.* The value of the constant 
term in the expression (61) for A^ has been obtained experimentally, 
for carbon and nitrogen in poly crystalline a-iron, by DIJKSTRA ; it was 
found that, for internal friction peaks close to room temperature, the 
maximum damping (which is equal to A#/2) is very closely equal to the 
weight per cent of interstitial solute. Because of the strong orientation 
dependence of & E , already discussed, it should be remembered that any 
texture in the specimens used, will lead to errors, irf the constant that 
relates the height of the peak to the solute concentration. On the other 
hand, relative values of the height of the peak, which may be studied as 
a function of heat treatment, give a precise measure *of the relative 
residual concentration of interstitial atoms in solution at any time. 
This is the basis for the study of precipitation which will be described 
below. 

Although the height of the internal friction peak is sensitive to orienta- 
tion, the temperature of maximum internal friction for a given fre- 
quency, i.e. the location of the peak, is independent of both orientation 
and the concentration of solute, but is a measure only of ttie jump rate 
of interstitials at that temperature (equation (59)). 

The generality of the SNOEK mechanism has been demonstrated by 
the observation of effects, in other b.c.c. lattices containing interstitial 
solutes, similar to those in a-iron. Thus KE investigated relaxation 
effects due to carbon and oxygen, 64 and also to nitrogen, 65 in tantalum. 
In the case of oxygen the observed internal friction peak is considerably 
broader than that corresponding to a single time of relaxation. It was, 
therefore, suggested that oxygen interstitials in tantalum may be 
located in two different kinds of positions, so that two types of jumps are 
possible. Further investigation of carbon in tantalum and also in 
columbium has been carried out by WERT. 66 In the nonmetallic 
category, internal friction peaks observed 67 in glass have been attributed 
to the diffusion of sodium and potassium ions to preferential positions 
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in the glass structure, in very much the same way as in the case of 
metals. 

The internal friction peak originating in the stress-induced ordering 
of interstitial solutes has been utilized for extensive investigations of 
precipitation from supersaturated solutions, especially in the case of 
a-iron. It has already been shown that A# is proportional to n, the 
concentration of interstitials in solution. This is not the same as the 
total concentration of solute present. The solute is usually introduced 
into the lattice by a high temperature treatment in an appropriate 
chemical atmosphere. At this high temperature, the solubility is usually 
considerably greater than at lower temperatures, so that if equilibrium 
is to be maintained, precipitation of a second phase rich in solute must 
take place at lower temperatures. The solute thus precipitated is 
usually tightly bound, in the form of an intermetallic compound, and 
cannot contribute to the internal friction. The height of the peak is, 
therefore, a measure only of the concentration of interstitial atoms in 
solid solution, and decreases as precipitation takes place. This suggests 
a unique method for following the precipitation, since the actual amount 
of precipitate can be obtained quantitatively as a function of ageing 
time and temj^rature. No other physical property that has been used 
to follow the coui^se of precipitation (electrical resistivity, thermoelectric 
effect, magnetic properties, density, and microscopic investigation) is 
determined so completely by the quantity of precipitate, and is indepen- 
dent of the condition or size of precipitate particles. This method was 
first applied by DiJKSTRA 68 to the study of the precipitation of nitrogen 
and carbon in a-iron. The internal friction peak for nitrogen is at 20C, 
and for carbon at 36C, when the vibration frequency is about 1 sec" 1 . 
The rate of precipitation is negligible at these low temperatures, so that 
a specimen is first quenched from the temperature at which carbon or 
nitrogen is introduced and then tempered for various periods at the 
desired precipitation temperature. After each heat treatment the 
specimen (in wire form) is quenched to prevent further precipitation 
and the height of the peak determined in a torsion pendulum. In this 
way, one can obtain the amount of solute left in solution, therefore, the 
amount precipitated, as a function of tempering time at a given tem- 
perature. To obtain a similar tempering curve at a second temperature, 
it is simply necessary to give the original specimen a re-solution heat 
treatment and quench. Using this method, DuKSTRA 68 finds that th 
precipitation of nitrogen in a-iron occurs in two successive stages. In 
the first, plates of an unknown precipitate (called "N phase 1") form 
on (100) planes of the matrix (determined by microscopic examination) 
and in the second, Fe^N plates form. It is also found that precipitation 
is much faster in cold worked a-iron and that the same type of precipi- 
tate is no longer formed. This study of precipitation in iron was 
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continued by WERT 69 and a quantitative interpretation given to the 
measurements by ZENER. ?O It is shown that the amount of precipitate 
q would, in the initial stages of the process, follow the law 

q^b(t-t,Y . . (62) 

where is the time for nucleation (time at which the nucleus is formed, 
so that growth occurs thereafter by direct diffusion of solute atoms to 
the nucleus). ZENER also shows that the value of r depends on the 
shape of the precipitate particles ; for spherical particles r = 3/2, for 
cylindrical particles r 2, and for circular discs r = 5/2. The results 
for precipitation of carbon in a-iron give r = 3/2, and for the first 
stage (N phase 1) of nitrogen precipitation, r = 5/2; the shape of 
particles in these two cases is, therefore, determined. In all cases t is 
found to be zero within experimental error. Apparently nuclei are 
present as soon as the material is quenched or, if there is an incubation 
time, it is very short compared to the time for precipitation. 

The experimental curves deviate from the simple power law of 
equation (62) in later stages of the precipitation process, where com- 
petition between adjacent particles for the solute atoms in a given region 
of solution becomes appreciable. More exact laws are derived to take 
these interference effects into account. 71 ZENER alo shows how the 
number of precipitate particles per unit volume, and, therefore, the 
mean spacing of particles, can be obtained from experimental values 
of the constant b of equation (62). 

As a part of the precipitation experiments, it is possible to determine 
the concentration of solute remaining in solution at any given tempera- 
ture after precipitation is complete, i.e. the solubility curve for the 
particular interstitial solute. Smaller solubilities can be determined by 
the internal friction method than those that one can usually determine 
by chemical methods. The solubility line for carbon in a-iron was 
determined 72 down to 150C, where a solubility of only 10~ 4 weight per 
cent was observed. This value is lower than all previous estimates and 
more directly obtained. 

The use of the height of the internal friction peak in interstitial 
solutions to measure the degree of precipitation is extended to the study 
of precipitation of carbon and nitrogen in cold worked a-iron by 
HARPER. 73 For a sufficiently large amount of cold work (5 per cent 
extension is sufficient) normal precipitation, as studied by WERT, does 
not occur. Instead, an analysis by CoTTRELL 74 and BiLBY 75 suggests 
that precipitation of solute atoms into edge dislocations takes place. 
These authors calculate that such precipitation will follow a law of the 
form of equation (62) with r = 2/3. HARPER verified this power law 
experimentally, and with the aid of the formula of COTTRELL and 
BILBY, obtained the density of dislocations in the specimens for 
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different degrees of cold work. Similar results were obtained for both 
carbon and nitrogen in oc-iron. It is striking that these two cases, for 
which normal precipitation occurs according to equation (62) with two 
different r values, follow a 2/3 power law when precipitation occurs in 
a cold worked lattice. 

The final illustration of the versatility of SNOEK'S internal friction 
peak is its application to the study of diffusion. The potentialities of 
internal friction methods for such investigations have already been 
discussed in connexion with equation (52). In the case of interstitial 
solutes, we have derived equation (59), which enables us to determine 
atomic jump rates directly, and quantitatively, from measured relaxa- 
tion times. The combination of low temperature internal friction 
measurements with high temperature conventional diffusion measure- 
ments makes possible the determination of D and F over a very wide 
range; H and D , or F (equation (54)), may then be determined with 
great precision. It would seem that the atomic jump rate F is the most 
fundamental quantity with which we are here concerned, and that both 
the measured diffusion coefficients and relaxation times would be con- 
verted into the appropriate values of F by means of equations (53) and 
(59), respectively. The custom of reporting diffusion coefficients has 
been so widespread, however, that more often r-values are converted 
into diffusion coefficients and compared directly with high temperature 
diffusion measurements. This focus of attention on D rather than F 
does not lead to any complications in interpretation, because of the 
simple proportionality between the two quantities. SNOEK 60 * 61 reported 
values of Z> and H for diffusion of carbon and nitrogen in a-iron, based 
on his measurements. More precise internal friction data were later 
reported by WEBT and ZENER. The most complete data for carbon 
in a-iron arc obtained 77 by the combination of static relaxation measure- 
ments with internal friction and conventional diffusion measurements, 78 
which give Z) 0-02 cm 2 /sec and H = 20,100 cal/mole, from a straight 
line plot of log D versus 2 1 " 1 that covers fourteen powers of ten in D 
over the temperature range 35C to 800C. It has been shown 76 that 
values of Z) may be predicted theoretically if it is assumed that the free 
energy of activation, or the work required to bring an interstitial atom 
to the midpoint between two interstitial sites, goes entirely into elastic 
straining of the surrounding lattice. The value of D Q calculated from 
this theory for the case of carbon in iron is in excellent agreement with, 
the experimental result reported above. Earlier reported values of D Q 
had not fitted the theory so well, because of the difficulty in a precise 
determination of D when the data for D are only over a limited tem- 
perature range. (D is obtained from the intercept at 1/T of the 
straight line plot of log D versus l/T. Low temperature data, as 
obtainable by internal friction methods, are, therefore, very valuable in 
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making possible a precise extrapolation of the straight line.) In other 
systems, particularly nitrogen in iron and carbon in tantalum, the 
agreement of reported values of D with the theory has improved, 66 
as the range of measurement was extended. This verification of the 
strain theory of D , wherever precise data for interstitial diffusion is 
available, suggests that, in other interstitial solutions, the heat of 
activation can be obtained from only a single measurement of the 
diffusion coefficient D by the substitution of the calculated D Q into 
equation (54). 

Examples of Random Substitutional Solutions 

A substitutional solid solution that shows no order under zero stress 
may also show anelastic effects resulting from stress -induced ordering. 
Such effects have only been observed at relatively high concentrations 
of solute (when the solute is present to ten or more atomic per cent), 
in contrast to the interstitial case where minute amounts of solute 
produce a sizable internal friction peak. The first observation of an 
internal friction peak due to stress-induced ordering in a substitutional 
solution was that of ZENER in the case of a single crystal of oc-brass 
(30 per cent Zn). These measurements were only at onp frequency and 
composition and the origin of the peak was not explained. Later KE SO 
verified the existence of such a peak in (70-30) a-brass at lower fre- 
quencies, and obtained a heat of activation in reasonable agreement 
with the heat of activation for diffusion of zinc in brss. A possible 
mechanism for this relaxation effect in brass was suggested by ZENER SI 
and will be discussed below. The investigation of this type of anelas- 
ticity was extended by the writer 82 to a-silver-zinc alloys (silver rich), 
where similar, but larger, effects were anticipated and found (due to the 
greater difference in atomic radii of Ag and Zn as compared to Cu and 
Zn). Extensive work 83 was carried out on the Ag-Zn system and the 
dependence of internal friction on composition and temperature were 
obtained. The observed internal friction peak is clearly a volume effect, 
since it is independent of grain size. However, if the grain size is too 
fine, the grain boundary internal friction (see the next section) also 
occurs, thus requiring a resolution of the two effects. The most success- 
ful investigations of the ordering internal friction are, therefore, carried 
out on single crystals or coarse-grained polycrystalline specimens. 

Clearly, the explanation for the observations in substitutional alloys 
cannot be the same as for interstitial alloys. For a lattice that ha.s 
cubic symmetry, any elastic distortion due to the substitution of a 
larger or smaller atom at one lattice site must also have cubic symmetry. 
There are, therefore, no preferential sites for such atoms when stress is 
applied. On the other hand, the distortion about a, pair of solute atoms 
in nearest neighbour positions no longer has cubic symmetry ; a pair of 
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solute atoms will produce tetragonal distortion about the direction of the 
pair axis. This loss of cubic symmetry in the case of pairs of atoms is 
the basis for the explanation given by ZENER SI for the anelastic 
behaviour of substitutional alloys. He suggests that pairs of substitu- 
tional solute atoms behave completely in analogy to single interstitials. 
For example, if the solute atom is larger than the solvent, then under 
zero stress, .pairs of solutes will be randomly distributed over all 
crystallographically permissible directions. If, now, a tensile stress is 
applied in a particular direction, the pair axes will show a preferential 
alignment in those crystallographic directions which are the nearest to 
the stress direction. The attempt of the lattice to establish a state of 
equilibrium at each value of stress will give rise to internal friction in 
exactly the same way as in the c"ase of interstitial solutions. The time 
of relaxation will be closely related to the rate of atomic movement in 
the alloy. A high enough concentration of solute is required, in the 
substitutional case, for the occurrence of pairs of atoms to be reasonably 
probable. This stress-induced preferential orientation of pairs of atoms 
is a form of stress-induced ordering. The type of order involved does not 
fall into the category of either long range or short range order, for it is 
assumed that the pairs themselves are formed by random fluctuations. 
The phenomenonis one of stress-induced ordering nevertheless, since 
the preference for special directions of pair axes is a deviation from 
randomness. 

Although the description of the relaxation phenomena in substitu- 
tional solutions in terms of reorientation of solute atom pairs has the 
advantage of its direct analogy to the case of interstitial solutions, it 
cannot be taken too seriously for several reasons. The "pair" concept 
is probably most nearly satisfactory at low solute concentrations, where 
most solute atoms are far apart and the number of pairs is quite small. 
Even in such a dilute solution, where the internal friction effects are 
probably too small to be observed, it is not correct to think of a fixed 
number of pairs of sohite atoms, since a tendency toward preferential 
creation or dissolution of nearest neighbour pairs may occur under stress. 
In actual measurements, solutions containing as high as one-third solute 
atoms have been used ; here clusters of solute atoms considerably more 
complex than pairs occur with great frequency. The behaviour of such 
clusters under stress cannot readily be taken into account in terms of a 
theory of reorientation of solute pairs. In fact, the very concept o 
solute and solvent becomes ambiguous as the concentrations approach 
50 per cent ! 

If the pair-reorientation picture is an oversimplification, and, there- 
fore, not an acceptable explanation for the observed behaviour of 
substitutional solutions, can we still be sure that the phenomenon is one 
of stress-induced ordering, controlled by volume diffusion within the 
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alloy ? There is excellent evidence that leads to an affirmative answer 
to this question. In the first place, these effects are observed in well- 
annealed single crystals and are not structure sensitive. In fact, the 
observations are quantitatively reproducible in different specimens of 
the same alloy. The possibility that the observed phenomena are 
related to the presence of internal interfaces or of dislocations is, 
therefore, discounted. Secondly, these anelastic effects are only 
observed in alloys, and are, apparently, larger the greater the atomic 
size difference of the solute and solvent atoms. Finally, the width of 
the internal friction peak corresponds very closely to a single time of 
relaxation, and the heat of activation is of the correct magnitude for 
volume diffusion. In view of this evidence, it seems almost certain that 
the relaxation is controlled by the movement of atoms in the lattice, 
which acts to produce an ordered condition of the alloy under an applied 
stress. It is, therefore, expected that the measured time of relaxation r 
and the appropriate jump frequency F, are related by equation (52). 
The dimensionless constant a cannot be calculated, as it was in the 
interstitial case, until we know more precisely the exact nature of the 
ordering process. A reasonable value for a can be estimated, 83 however. 
In the interpretation of equation (52), it is of great importance to 
specify what is the appropriate jump frequency F. The interpretation 
of this quantity is not as obvious in a substitutional alloy as in the 
interstitial case ; in a substitutional solution there are two significant 
rates of atom movement, those corresponding to each of the two types of 
atoms, say, A and B. If the mechanism of diffusion is not a direct 
interchange of lattice positions, these two rates, r A and F s , may differ 
significantly. The well-known experiments of KiRKENDALL 84 and of 
DA SILVA and MEHL 85 give evidence that the two atomic species 
actually do not move at the same rate, at least in several f.c.c. alloys. 
It is, therefore, probable that diffusion occurs by means of the move- 
ment of lattice vacancies. 86 In contrast to conventional diffusion 
measurements where the faster moving atoms make the major contri- 
bution to the measured diffusion coefficient, 87 the stress -induced 
ordering process is controlled by the rate of movement of the slower 
atomic species. This conclusion may be demonstrated as follows: 
let us suppose that B atoms move much faster than A atoms, or 
F/f ^> F A . At a reasonably high concentration of B, it becomes very 
improbable that a given B atom does not have at least one nearest 
neighbour that is also B. The B atoms, therefore, form a continuous 
network throughout the lattice. If diffusion is by means of vacancies, 
most of the movement of vacancies will be along this network of B 
atoms, because the rate of jump of a B into a vacancy is considerably 
greater than that of an A atom. No redistribution of atoms, or ordering, 
can be produced by such movement of vacancies along t the B network, 
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since for every B atom that has jumped another simply takes its place. 
Only on those relatively rare occasions when an A atom makes a jump 
will a contribution toward re-arrangement of the atom distribution in the 
lattice occur. Thus Y A controls the rate of relaxation, and this quantity 
is the appropriate jump frequency to be inserted in equation (52). The 
interpretation of the internal friction effects in terms of the pair 
reorientation of solute atoms, discussed above, leads one to emphasize 
the importance of the movement 
of solute atoms, 88 which may not 
actually be the rate controlling 
factor at all. 

An illustration of internal 
friction data for Ag-Zn alloys is 
given in Fig. 7, which shows a 
series of internal friction peaks 
at one frequency for different 
compositions. The figure shows 
that both the height and location 
of the peaks are very sensitive 
to chemical composition. The 
magnitude of *the maximum 
damping falls off rapidly as the 
concentration of zinc decreases, 
due to the smaller degree of 
ordering that can be produced 
at lower concentrations. The 
temperature at which internal 
friction is a maximum, or at 
which the time of relaxation = 
l/o> is higher the lower the zinc 
concentration. This result means 
that the controlling rate of atomic movement at any given temperature 
increases with increasing concentration of zinc. In the case of the lowest 
peak, an additional high temperature damping associated with grain 
boundaries becomes significant enough to distort the ordering peak. 
When data similar to that of Fig. 7 is obtained at different frequencies, 
the temperatures at which the peaks occur, are correspondingly shifted. 
By carrying out experiments over a wide range of frequencies, the relaxa- 
tion times for the various silver-zinc alloys may be obtained over a wide 
range of temperatures, as shown in Fig. 8. Data at the lowest tempera- 
tures can only be obtained by the use of frequencies too low to be 
practicable, and are, therefore, most conveniently obtained by static 
methods instead of by vibration. The curves of Fig. 8 represent 
essentially the, variation of atomic mobility with temperature. 
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Fig. 7. Internal friction data for a scries 
of silver-zinc alloys (frequency of vibra- 
tion 0-8 cycle per second at room 
temperature). The atomic per cent zinc 
in the various alloys are 30-2, 24-2, 19-3, 
and 15-8, corresponding to the curves A 
to D respectively 
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Unfortunately, no reliable conventional diffusion data is available for 
extension of these curves to high temperatures. The data obtained 
by internal friction and static relaxation methods cover a sufficiently 
large range, however, and is sufficiently precise, that a deviation from 
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semi-logarithmic relation 
is quite apparent. A 
straight line plot for log 
F versus T~ l is only to 
be expected if H is a 
constant (equation (54)). 
There is no theoretical 
reason, however, why H 
may not be dependent 
on T. Conventional dif- 
fusion data have never 
been reliable enough to 
report such a deviation, 
except when channels 
for diffusion, other than 
volume- diffusion, occur 
(e.g. diffusion down grain 
boundaries). The curves 
of Fig. g also show a 
strong concentration de- 
pendence of atomic 
mobility. An increase of 
about 10 per cent in zinc 
concentration means an 
increase in the controlling 
rate of atomic movement 

by a factor of ten. It is 
Fig. 8. The variation of the time of relaxation for interesting to try to gee 
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if this strong concentra- 
tion dependence resides 
primarily in the term F or in H, Extrapolation of the curves of 
Fig. 8 to T~ l = shows that the values of F for the various concen- 
trations are very nearly the same and, therefore, that the differences 
in atomic movement as a function of concentration are principally 
determined by differences in the heat of activation H. The intercepts 
F , and the corresponding values of D , may be compared with the 
strain theory of Z> (discussed earlier in connexion with interstitial 
diffusion), which has also been applied 89 to substitutional solutions. 
The agreement with theory is quite good. In the case of conventional 
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diffusion data, the agreement between reported values and theoretical 
values has generally been poor, but this lack of agreement has been 
related 90 to the presence of mechanisms other than volume diffusion 
in many of these experiments. 

In addition to the above experiments, the stress-induced ordering 
effect has also been used for experiments fundamental to our under- 
standing of -the mechanism of diffusion. In terms of the vacancy 
mechanism of diffusion, which has been widely accepted, the rate of 
diffusion or atom movement depends on the product of the concentra- 
tion of vacancies and the rate of jump of atoms into vacancies. Both 
of these factors are strongly temperature dependent, but conventional 
diffusion measurements are incapable of separating the two. On the 
other hand, by means of low temperature relaxation measurements it 
has been possible to detect 4>he presence of a nonequilibrium concentra- 
tion of vacancies in a silver-zinc alloy that had been quenched rapidly 
from a high temperature. 91 The effect of such quenching is to trap some 
of the relatively large number of vacancies present at high temperatures, 
which then produce abnormally rapid atomic movement, and, therefore, 
abnormally short relaxation times, at low temperatures. For example, 
the atomic jun^p rate, at 50C is increased by a factor of 10 5 over the 
equilibrium rate Hi a Ag-Zn wire that had been quenched from 400C 
into water at room temperature. Such experiments not only give strong 
additional evidence for a vacancy mechanism, but also make it possible 
to study separately the two factors which determine the rate of atomic 
movement. Furthermore, from observations on the rate at which the 
"quenched-in" vacancies disappear, it is hoped that the mechanism by 
which lattice vacancies are created and absorbed will be determined. 
It has been postulated, 92 ' 93 for example, that dislocations act as sources 
and sinks for vacancies. 

Examples of Initially Ordered Substitutional Solutions 

If a solid solution shows an incomplete state of long range order under 
zero stress, then, in accordance with the theory presented in the 
beginning of this section, a change in the state of order will be produced 
by an applied stress if the ordering process is accompanied by a change 
of lattice dimensions. This phenomenon of stress -induced change in the 
degree of order, first predicted by GoRSKY, 94 leads to mechanical 
behaviour corresponding to that of the standard linear solid. Unlike, 
the two cases already considered, for which the solid solution is dis- 
ordered under zero stress, the characteristic internal friction peak in the 
present case is observable only as a function of frequency (at constant 
temperature), and not as a function of temperature, at constant fre- 
quency. The reason for this difference is that, for the initially ordered 
solution, the degree of order at zero stress is itself a function of tempera - 
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ture. Thus, experiments which simply determine internal friction as a 
function of temperature are difficult to interpret. Such experiments, 
carried out by KosTER 95 " 98 and collaborators on several different 
ordering alloys, show that internal friction increases to very high values 
in the vicinity of the Curie point for long-range order, and continues to 
rise, as the temperature is increased into the disordered region, until 
values are obtained which are outside the range that the apparatus can 
measure (about </> 0-1). In addition to the fact that the degree of 
order at zero stress depends upon temperature, it has also been shown 5 
that in the immediate proximity of the Curie point A# becomes very 
large, since in this range the ordering energy is a very rapidly varying 
function of the degree of order. It is, therefore, difficult to draw quanti- 
tative conclusions from the type of measurements obtained by KOSTER, 
unless supplemented by other experiments. < 

In the case of CuAu, whose ordered structure is tetragonal (c/a 1-1), 
striking relaxation effects are to be expected, due to stress-induced 
changes in the degree of long-range order. The existence of such effects 
was predicted 94 and confirmed 99 by GORSKY, who carried out elastic 
after-effect measurements at constant temperature. (These experiments 
are analogous to measurements of internal friction ap a function of 
frequency of vibration. 5 ) fr 

INTERNAL FRICTION ORIGINATING IN INTERFACES 
Stress Relaxation Across Incoherent Interfaces 

Internal interfaces in a metal provide important sources of internal 
friction. A widely studied source of damping is the relaxation of shear 
stress across incoherent interfaces, such as large angle grain boundaries 
and interfaces between different phases. The grain boundary case has 
already been reviewed in this series ; 100 the discussion here will, there- 
fore, be brief and will mainly concern itself with the question of the 
distribution of interfaces. This point of view was recently reviewed by 
the writer. 101 

The concept of stress relaxation across an interface is readily des- 
cribed if we imagine a single interface of limited area located in a metal, 
a hypothetical case, to be sure. The application of a shear stress to such 
a specimen produces a state of uniform stress throughout the material, 
but this state will not remain indefinitely. Because of the inability of an 
incoherent interface to sustain a shear stress across it, stress relaxation 
will take place by a process of slipping across the interface until the 
stress there has gone to zero. Shear stress, therefore, becomes redistri- 
buted so that it is concentrated along the edges of the interface. This 
relaxed state is indicated schematically in Fig. 9, where lines of force are 
used to represent the stress field, such that the number of lines cutting 
across a unit area is proportional to the shear stress. i Because of the 
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increased stress at the ends, the over-all shear strain in the material is 
increased as a result of this relaxation process. When stress is released, 
the existence of a reverse stress across the interface, results in the 
material being restored to its original state. 

Such a material, therefore, behaves like a standard linear solid, where 
the parameter p, which is now the difference between the mean stress 
and that across the interface, approaches its equilibrium value according 
to a relaxation equation (equation (35)). It is readily shown 101 that the 
time of relaxation r is proportional to the linear dimension of the inter- 
face, but the temperature dependence of r (i.e. the activation energy H) 
is independent of the size of the interface if the unit mechanism of the 
process is everywhere the same ; Fig. 9 shows that, in the relaxed 
condition, the shear stress undergoes a significant decrease in an 



Fig. 9. Schematic representation of an incoherent 
interface across which relaxation of shear stress has 
occurred. The shear stress at any point is the number 
of lines per unit area. The cross section of the sphere 
of relaxation is represented by the dashed circle 



approximately spherical region whose cross-section is outlined by the 
dashed circle. This region is called the sphere of relaxation. 

In a well -annealed poly crystal line metal there is a regular collection 
of interfaces similar to the one just described, in which relaxation, or 
so-called "viscous slip," is held up at the edges and corners where the 
various boundaries meet. The spheres of relaxation will not overlap 
significantly, which means that each interface responds to stress 
independently of the others. The mechanical behaviour of the material 
is, therefore, a simple superposition of the contributions due to the 
individual interfaces. Since the linear dimensions of all interfaces are 
not the same, the internal friction peak is usually considerably broader 
(by a factor of 2 or 3) than the peak which corresponds to a single time 
of relaxation. This is the well-known "grain boundary peak" that has 
been studied by KE 33 102 and reviewed in the article by KING and 
CHALMERS. 100 The shift of this peak with frequency gives the heat of 
activation H for the unit mechanism responsible for the relaxation. 
A very large A M effect accompanies the grain boundary relaxation ; the 
apparent modulus at low frequencies may be 30 per cent lower than the 
equivalent value for a single crystal. 

On the high .temperatures side of the grain boundary peak, the 
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internal friction, falls at first, but then begins to rise again. This effect 
has been attributed 28 ' 29 to slipping at the interface corners where the 
stress is concentrated after relaxation; the same phenomenon results 
in a permanent set when a static stress is applied. These effects were 
previously interpreted differently by KE. 103 

Internal interfaces which are not allowed to grow at high tempera- 
tures to form a regular array, like the grain boundaries of a well 
annealed poly crystalline metal, may show more complicated effects. 
If the array of interfaces is irregular, or somewhat haphazard, so that 
large overlapping of the spheres of relaxation occurs, the behaviour of the 
collection of interfaces can no longer be regarded as a simple super- 
position of the behaviour of the individual interfaces, but instead we 
obtain an example of coupled relaxations (page 21). In this case, when 
one interface relaxes it produces a significant increase in the shear 
stress across a second, and vice versa, so that very large non-elastic 
strains may be produced. Such a description was used to explain the 
large internal friction obtained after discontinuous precipitation in an 
aluminium-zinc alloy. 25 Instead of the usual broadened peak, the 
internal friction seems to climb to exceedingly high values at high 
temperatures, or, if temperature is held constant, at Jow frequencies. 
Such behaviour could still be described, phenomenokfgically, in terms of 
a superposition of relaxation times, but the range of times required 
would be very large and the values would have little physical signifi- 
cance. The fact that highly cold- worked materials show this type of 
behaviour, and the interpretation of this fact, will be discussed in the 
next section. 

Although the general descriptive features of stress relaxation across 
incoherent interfaces may be given without knowledge of the atomic 
mechanism responsible for this behaviour, the explanation of the 
kinetics of the process must be closely tied in with the atomic mechanism. 
Thus, much attention has been given to values for the heat of activation 
for grain boundary stress relaxation. On the basis of very little evidence 
available at the time, KE 104 suggested that the heat of activation for 
grain boundary relaxation may be the same as that for volume diffusion. 
This correlation is a surprising one, inasmuch as there seems to be no 
theoretical reason for it. Subsequent experimental evidence 28 ' 101 105 
has shown values of the activation energy for grain boundary relaxation 
both greater and smaller than that for volume diffusion. Apparently, 
the suspected correlation is not correct, but the mechanism of stress 
relaxation is still not understood. A mechanism suggested by MoTT, 106 
and discussed at length in the earlier article, 100 is disputed on the basis 
of evidence presented by the writer. 101 The effect of minute amounts of 
impurities on the activation energy for stress relaxation is not clear, 
as yet, although it is known that impurities may have t a large effect on 
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both the position and height of the grain boundary internal friction 
peak. 44 ' 107 Further investigations of the heat of activation for stress 
relaxation across grain boundaries will most certainly contribute to our 
understanding of the nature of the grain boundary. 

Movement of Twin Interfaces 

When transformation from a cubic to a tetragonal structure occurs, 
heavy twinning is necessary in order to avoid high residual stresses in 
the material. Such twinned structures have been observed in a Mn-Cu 
(12 per cent Cu) alloy 108 as well as in In-Tl alloys. 109 An internal friction 
peak with maximum at about 0C was observed by WORRELL for the 
Mn-Cu alloy. Inasmuch as crystallographic coherency exists across a 
twin interface, the explanation for the observed internal friction cannot 
be the same as for the viscous-like slipping of incoherent interfaces. 
The tetragonality of the lattice structure suggests that the application 
of shear stress will induce movement of a twin interface normal to 
itself, in such a way as to enlarge the region where the direction of 
tetragonality is favourable with respect to the applied stress relative to 
that in which it is unfavourable. The fact that an internal friction peak 
is observed, leg,ds to the belief 5 that this movement of twin boundaries 
is a relaxation phenomenon. Further measurements have been made 110 
for the Mn-Cu alloy, but internal friction has not yet been studied as a 
function of frequency. There is, therefore, still some uncertainty as to 
whether the internal friction actually originates in a relaxation process ; 
if it does, the significance of the time of relaxation must be ascertained. 

INTERNAL FRICTION RELATED TO PLASTIC PHENOMENA 

The type of internal friction to be discussed here is related to the 
internal stresses, or dislocations, introduced by cold working and is 
commonly referred to as "cold work internal friction." The latter 
designation is not accurate inasmuch as effects are also observed due 
to the residual dislocations in well annealed materials. 

In contrast to our relatively recent awareness of the other sources of 
internal friction discussed in this article, the study of damping of 
plastic origin has a long history. As pointed out by GuiLLET, 111 even 
the earliest bell makers were cognizant of the fact that brittle materials 
gave the lowest damping. In recent years, considerable effort has been 
directed toward the study of this cold work internal friction. Yet, m 
spite of the effort expended, our understanding of the nature of this 
type of damping is in a very elementary stage. The reasons for such 
slow progress are many. The effects are very complex and a consider- 
able number of variables are involved, among them, the degree of cold 
work and the type of cold work given the specimen material, the time 
at room temperature before measurement, the frequency of vibration, 
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and the impurity content of the specimen. Much of the work in the field 
is not fundamental in character but is concerned only with the measure- 
ment of internal friction, at one temperature and frequency, as a 
function of some alloying clement or a particular heat treatment. Often 
no attempt is made to see if other sources of internal friction may also be 
present to complicate the results. As a consequence, it is usually difficult 
to compare data by different investigators, which often contradict 
each other. 

On the brighter side, there have been a sufficient number of funda- 
mental investigations to give us a collection of important facts. These 
basic facts will first be reviewed without any attempt to interpret them. 
Finally, a discussion in terms of dislocation theory will attempt to see 
to what extent the factual material can be explained. 

Effect of Cold Working 

The room temperature internal friction of a well annealed material 
usually has a very low value, of the order 10~ 5 , although this value is 
apparently very sensitive to the purity of the material. The structure 
sensitivity of this damping, especially of single crystals, is most striking ; 
various investigators 9 ' 112 > 113 report that slight jarring ^>r almost any 
handling operation will produce a large increase in damping. In addi- 
tion, two single crystals prepared of the same material and in exactly 
the same way differ in their internal friction values, sometimes by as 
much as a factor of ten. 114 * 115 Similarly different parts of one crystal 
do not show the same internal friction. 12 * 116 

The application of a small static stress to a well annealed material 
will usually raise its internal friction appreciably, even though no 
observable yielding has occurred. The duration of loading seems to be 
important since it seems possible 113 to produce a larger damping by 
dropping or jarring a specimen, than by equivalent static loads. This 
point bears further investigation, however. Systematic investigation 
of the effect of cold working on damping has been carried out using 
various systems of stressing : application of compressive loads, pulling 
in a tensile machine, drawing through dies. One cannot be certain that 
the same types of internal stresses are always produced by these various 
methods and, therefore, that the data obtained are always comparable. 
Several authors show that the room temperature internal friction as a 
function of cold work goes through a maximum. ZENER, CLARKE and 
SMITH 50 report such a maximum in polycrystalline brass at about 5 per 
cent elongation, for measurements at a frequency of 2000 c.p.s. The 
observed damping, even at the maximum, is very small, but this may 
be related, as we shall see, to the fact that about two weeks elapsed 
between the cold working and the measurements. LAWSON IIS reports 
very striking maxima for spectroscopically pure polycrystalline copper, 
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measured at 50 kc/sec. Correspondingly, a minimum is observed in the 
dynamic Young's modulus of the specimens (&E/E about 6 or 7 per 
cent) at the same amount of cold work that produces the maximum 
damping. The height of the maximum in the room temperature damp- 
ing is greater the lower the temperature at which the deformation is 
carried out. HASIGUTI and HiRAi 115 obtain similar maxima for copper 
single crystals, measured at frequencies in the tens of kilocycles. The 
maxima are not identical for different specimens, but occur for 1-10 per 
cent elongation and for values <f> X 10 3 in the range 3-10. For large 
amounts of cold work the internal friction apparently decreases to a 
constant value greater than the value for the annealed specimen. 
BouLANGER 117 also observes a maximum in the residual damping (after 
room temperature annealing) of a nickel-chrome alloy. Other investi- 
gators report no maximum, but a monotonic increase of damping with 
cold working of aluminium 118 ' 119 and iron, 120 although the rate of rise of 
the curves is considerably decreased for large amounts of cold work. 
In cases where simultaneous observations of the dynamic modulus are 
made, a decrease in the modulus is generally observed to parallel an 
increase in the internal friction. This decrease in modulus may be as 
large as several per cent. 121 As we shall see, only part of this decrease 
may be interpreted as a AM effect. 

Although the behaviour of pure and impure materials as a function of 
degree of cold work seems to be about the same, qualitatively, the 
magnitude of the changes produced by cold working is greater for the 
purer materials. 118 ' 119 

Effect of Annealing 

The conventional description of the effect of annealing a cold worked 
metal at successively increasing temperatures is that no significant 
internal changes occur until recrystallization begins, except for the 
relief of "macrostresses" responsible for stress-corrosion cracking. 122 
This description is based on the fact that resistivity, x-ray line broaden- 
ing, microstructure, hardness, and tensile strength all show essentially 
no change as a result of annealing below the recrystallization tempera- 
ture. (Hardness and tensile strength may even increase slightly for 
low temperature anneals.) It has, therefore, been supposed that the 
internal stresses produced by plastic flow are not relieved before 
recrystallization occurs. Internal friction measurements show sur 
prisingly different phenomena. The room temperature damping (usually 
at high frequencies) produced by cold working may recover completely 
to the value it had before cold working, after anneals at temperatures too 
low to produce any changes in the properties mentioned above, as first 
shown by FORSTER and KosTER 95 for brass and low-carbon steel. The 
prior recovery of internal friction has no influence on the tendency of the 
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material to recrystallize at sufficiently high temperatures and, con- 
versely, the room temperature damping is little affected by the onset of 
recrystallization, if this damping has already recovered during a low 
temperature anneal. More extensive investigations of the annealing of 
cold worked a-brass are reported by KOSTER and RosENTHAL. 121 It is 
shown that sizeable recovery of internal friction occurs at room tem- 
perature in tens of hours. Measurements of the damping as a function 
of time show that the rate of recovery is greatest at the shortest times. 
Since measurements in these experiments were started one-half hour 
after cold working, initial values of the damping may have been con- 
siderably higher than any of the reported values. Very rapid measure- 
ments that may be taken immediately after plastic deformation, are 
therefore desirable in annealing studies. The recovery of internal 
friction at room temperature occurs in about the same time for specimens 
given prior extensions of from 10 to 60 per cent, except that the starting 
values are higher for the greater elongation. The work of KOSTER and 
ROSENTHAL indicates that the recovery of room-temperature internal 
friction in cold worked brass takes place in a similar manner at various 
annealing temperatures, the time of recovery decreasing from tens of 
hours at room temperature to minutes or seconds at ^00C. Similar 
measurements on cold worked brass are reported by'FusFELD, 123 who 
shows that the effective heat of activation for the recovery process is 
substantially lower than 10,000 calories per mole. Observations of the 
effect of annealing on internal friction in aluminium 119 -and iron 120 are 
reported by KOSTER. The results for aluminium seem to stand alone in 
that they show incomplete recovery below the recrystallization tempera- 
ture. The case of a-iron is very striking ; here the recrystallization 
temperature is at 500-600C, yet damping recovers completely at room 
temperature in ten hours, and at 100C in minutes. Similar effects are 
also observed in copper. 115 ' 42 

The ability of cold worked materials to anneal at room temperature, 
as far as internal friction measurements are concerned, may, in part, 
explain some of the discrepancies in the effect of cold work on internal 
friction as reported by various investigators. The results of Bou- 
LANGER 117 show, for example, that whether or not a maximum is 
obtained in the curve of damping versus degree of cold work, depends 
on the time that has elapsed between the cold working and the measure- 
ments. 

The distinct difference between the effect of annealing on internal 
friction and on most other properties leads to the almost unavoidable 
conclusion that there are at least two distinct types of internal stress : 
one responsible for hardness and gross plastic deformation, which is 
required in order that recrystallization may occur, and the second 
responsible for internal friction effects. 

52 



INTERNAL FRICTION IN METALS 

The dynamic modulus is apparently somewhere between hardness 
and internal friction in its reaction toward annealing. A small part of 
the large decrease in modulus, produced by large amounts of cold work, 
recovers together with the internal friction; the remainder is not 
affected by anneals below the recrystallization temperature. This 
residual effect is clearly observed, 121 either after annealing for a long 
time at room temperature, or after successive anneals at increasing 
temperatures have completely eliminated the cold work internal fric- 
tion. The fact that the recoverable part of the decrease in dynamic 
modulus is eliminated in the same time as the internal friction, is strong 
evidence that it represents a AJ/ effect, i.e. it results from an additional 
in-phase component of strain produced by the same non-elastic 
mechanism as that responsible for the damping. The part of the 
decrease in dynamic modulus which is not recoverable below the 
recrystallization temperature is apparently a lowering of the modulus 
M' (equation (9)) that relates the stress to the elastic strain. That the 
presence of internal strains will produce such a lowering of the elastic 
modulus, was demonstrated by ZENER. 124 One should also keep in 
mind the fact that large amounts of cold work may also change 
the modulus by producing preferential orientation (texture) in a 
poly crystalline material. 

Strongly Amplitude Dependent Damping 

Although amplitude dependence of internal friction is observed 125 * 126 
for vibration at large strain amplitudes, it had been believed 1 that, so 
long as the maximum strain amplitude in a specimen is kept below 
10~ 5 or 10~ 6 , damping is independent of amplitude. That this belief is 
not always correct is strikingly demonstrated by the experiments of 
READ,' 10 which show that the internal friction of annealed and slightly 
cold worked single crystals of zinc and copper is strongly amplitude 
dependent, even at strain amplitudes of 10~ 7 ; for the case of zinc the 
damping may increase by almost one hundred times from low ampli- 
tudes up to 10~ 6 . These experiments, which use the piezoelectric 
method, also show that, associated with the amplitude dependent 
damping, there is an amplitude dependent AJf effect, as measured by 
the decrease in resonant frequency with increasing amplitude of 
vibration. The resonant frequency in these experiments is in the tens of 
kilocycles. It is shown that the curve of damping versus strain ampli- 
tude for zinc crystals is not reversible, i.e. different values are obtained 
for decreasing amplitudes than those for increasing amplitudes. These 
results are interpreted as evidence that the measurements themselves 
produce internal changes in the crystals. Further experiments on these 
effects in zinc crystals are reported by several investigators. 127 ' 114 49 
The experiments on copper crystals show considerably simpler effects, 
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probably because room temperature represents a much lower "reduced 
temperature" for copper than for zinc. (Copper has a much higher 
melting temperature and recrystallization temperature than zinc.) The 
damping-amplitude curves for copper crystals are not affected by the 
measurements themselves and, therefore, represent a measure of the 
mechanical stcite of the material. Further experiments on copper 

crystals are those of MARX and 
KoEHLER 21 and of the writer. 11 12 
Fig. 10, taken from reference 
12, shows the effect of succes- 
sive applications of a static 
compressive stress on the curves 
of internal friction versus strain 
amplitude, for a copper crystal. 
The stresses were sufficiently 
small so that no observable 
change of length of the crystal 
occurred as a result of stress- 
ing. Similar data were also 
given by READ r 10 It should be 
noted that ffiese curves start 
in a parabolic fashion, so that 
the damping below a strain 
amplitude of 1(X~ 7 is practically 
independent of amplitude. One 
could then ignore the amplitude 
dependent part of the damping 
and concentrate only on the 
internal friction at very low 
amplitudes. On the other 
hand, the strong amplitude 
dependence seems to be the 
most characteristic feature of 
this type of damping. Further- 
more, it is possible to measure 

very precisely the changes in resonant frequency with strain amplitude, 
so that the amplitude dependence of both the A M effect and the damp- 
ing may be obtained simultaneously. (Changes in resonant frequency 
with amplitude can be measured to better than 1 part in 50,000.) For 
these reasons, the work of the writer concentrates on a quantitative study 
of the amplitude dependent internal friction and A M effect. The experi- 
mental results show that the ratio of the amplitude dependent part of 
damping to that of the A M effect is independent of frequency and of the 
temperature ; this fact could not be explained in terms of any mechan- 
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Fig. 10. Kffect of successive applications of 
static compressive stress on the amplitude- 
dependent internal friction in a single 
crystal of copper 
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ism of viscoelastic behaviour. On the other hand, the experimental 
values of this ratio are in quantitative agreement with the predictions 
of the theory of static hysteresis as given by equations (49). The 
exceptional amplitude dependence of this type of internal friction might, 
in itself, have led us immediately to suspect a mechanism of static 
hysteresis. This conclusion is suggested by the discussion (in the earlier 
section on static hysteresis) which shows that such a mechanism 
inherently leads to strong amplitude dependence because of the non- 
linearity of the static hysteresis loops, in contrast to viscoelasticity, 
which is essentially a linear phenomenon at low amplitudes. 

The reader may wonder why the phenomenological mechanism might 
not have been decided with certainty by a simple study of the frequency 
dependence of internal friction. Such measurements do not provide 
decisive results in the present case, because of the extreme structure 
sensitivity of the property we are measuring. As long as one observes 
that two crystals, grown and treated identically, or that the two halves 
of one crystal, 116 differ considerably in values of internal friction, it 
becomes difficult to measure any frequency effects that are not masked 
by such variations. Even measurements at a series of natural fre- 
quencies (overtones) of one given specimen do not represent the 
properties of the oame part of the lattice, since the maximum stress 
amplitude occurs in different parts for different harmonics. Observed 
variations with frequency are, therefore, different for different speci- 
mens, but the scatter seems to fall about a frequency independence in 
the case of the copper single crystals. 11 For more highly cold worked 
poly crystalline materials, greater homogeneity seems to exist, and 
internal friction is reported 50 to be independent of frequency. 

The temperature dependence of the curves of internal friction versus 
amplitude has also been studied for single crystals of copper in a limited 
temperature range. 11 The internal friction and AAf effect increase with 
temperature in this range of measurement, for which a tentative 
explanation has been advanced. 128 Other observers report 129 ' 13 
different results for the temperature dependence. Further experimenta- 
tion along these lines will undoubtedly be fruitful. 

Surface conditions seem to have an effect on internal friction, but it is 
reported 127 ' 49 n that etching the surface results in a lowering of the 
internal friction curve as a whole without affecting the amplitude 
dependent part. 

The demonstration that amplitude dependent damping in single 
crystals at high frequencies originates in a mechanism of static hysteresis, 
and the close relation between this mechanism and strong amplitude 
dependence, means that it is always desirable to know whether damping 
is or is not strongly amplitude dependent. Unfortunately, many of the 
publications give no clue regarding this matter, either because the 
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apparatus used involved weak electromechanical coupling, so that all 
strain amplitudes are below 10~ 7 where no amplitude dependence is 
observed, or because the experimenter did not check on the possibility 
of amplitude dependence. 

Study of a Heavily Cold Worked Material 

One investigation by KE IIS and ZENER ISI deserves special attention, 
since it can be interpreted in terms of a mechanism which differs greatly 
from static hysteresis. The experiments differ in two ways from the 
single crystal investigations. First, the measurements are at low 
frequencies, i.e. obtained by means of a torsion pendulum. Second, the 
specimens (pure and commercial aluminium) are severely cold worked, 
by drawing them through dies to a 95 per cent reduction in area. The 
internal friction of these heavily cold worked specimens is independent 
of amplitude up to strain amplitudes of 10~ 5 . It originates in anelastic 
behaviour, since creep produced by static stress is completely recover- 
able; furthermore, it varies with frequency and temperature as a 
function of the product <joe HIRT , with // equal to 31,000 cal/mole. The 
curve of internal friction versus frequency does not show a maximum, 
but climbs to very high values at low frequencies, ^^is behaviour is 
completely comparable to that to be expected in the case of coupled 
relaxations of an irregular network of interfaces, discussed in the 
previous section. In the present case the interfaces are slip bands, which 
acquire a viscous behaviour due to the interlocking of the dislocations 
present in high concentrations in these bands. 131 The irregular distribu- 
tion of the bands results in a coupling phenomenon, as discussed earlier. 
The interpretation in terms of a network of slip bands is in agreement 
with the observations of Wooo 132 that a heavily cold worked material 
is fragmented into crystallites. We may regard the slip bands as the 
crystallite boundaries. 

Discussion in Terms of Dislocation Theory 

Most of the important facts concerning the internal friction of plastic 
origin have been discussed, and the reader, reflecting upon them, will 
undoubtedly agree that many careful experiments remain to be per- 
formed before a detailed understanding of the atomic mechanism of this 
damping will be possible. An attempt at interpretation of the measure- 
ments in terms of dislocation theory must, in the present stage, be 
somewhat speculative. On the other hand, new experiments are 
generally the outgrowth of speculation. 

In two investigations progress has been made toward an under- 
standing of the mechanism of damping, at least from aphenomenological 
viewpoint. These are the investigations of amplitude dependent 
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damping in single crystals of copper at high frequencies, and the 
study of heavily cold worked aluminium at low frequencies. Let us 
start with the latter. Here a satisfactory explanation may be given in 
terms of slip bands or crystallite interfaces. The behaviour of regions 
that are very dense in dislocations is not conveniently discussed in 
terms of dislocation theory, for the same reason that it is not profitable 
to discuss large angle grain boundaries as made up of dislocations : the 
identity of the individual dislocations is lost. The study of heavily cold 
worked material seems to fit best into the category of damping resulting 
from stress relaxation across incoherent interfaces. The fact that the 
observed heat of activation for the damping in heavily cold worked 
aluminium is the same as that for grain boundary relaxation in that 
material, within experimental error, is further evidence that the two 
effects are basically the same. The grain boundaries, of course, repre- 
sent a more regular collection of interfaces than the slip bands of a 
heavily cold worked metal, and, therefore, produce an internal friction 
peak. Annealing at low temperatures of the heavily cold worked material 
may result in lowered damping due to the production of greater regu- 
larity in the crystallite boundaries without appreciable growth of 
crystallites (which would lead to recrystallization). 

Now let us tuM to the behaviour of slightly cold worked materials, 
where the properties of individual dislocations may be more readily 
discussed. As already pointed out, we must conclude, from the easy 
room temperature recovery of cold work internal friction, that the type 
of internal stress responsible for internal friction must differ from that 
which produces recrystallization and hardness, in that it is less stable 
or more mobile. Translated into dislocation language we may say that, 
where as dislocations tightly bound in internal interfaces are responsible 
for gross plastic flow, the internal friction in a material that is not too 
heavily cold worked is associated with relatively free dislocations. We 
may imagine that during low temperature annealing these free disloca- 
tions are not lost from the lattice but simply become bound, either to 
impurity atoms or to other dislocations (presumably in internal surfaces). 
The ease with which recovery takes place, as indicated by its occurrence 
at room temperature and by the fact that the heat of activation for the 
process is small 123 relative to corresponding values for volume diffusion, 
shows that the recovery process does not involve atomic diffusion. If 
recovery involves the locking of dislocations by impurity atoms, it is 
the dislocations that move to the impurity atoms, rather than vice 
versa. The occurrence of a mechanism of static hysteresis in high 
frequency measurements is now readily interpreted. The static hystere- 
sis loop is a consequence of a certain number of relatively free disloca- 
tions being torn loose from the small barriers that hinder their motion, 
for each increment of stress da. The stress-strain loop is then really a 
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step-like affair, each step corresponding to the burst of a single disloca- 
tion, which moves almost instantaneously from the original barrier to 
another position where its motion is again interrupted. Each such dis- 
location burst produces non-elastic strain. Once a dislocation is pulled 
loose it will not generally return to its original position unless a reverse 
stress is applied. The magnitude of an individual step that constitutes 
the stress-strain loop is a small fraction of an atomic distance, too 
small to be observed as a step. The stress-strain loop that has been 
described is analogous to the magnetic hysteresis loop, in which sudden 
movements of magnetic domain boundaries are induced by an applied 
magnetic field in the same way as dislocation bursts are induced by an 
applied stress. 

A qualitative estimate of the dependence of internal friction on the 
degree of cold work may be given in terms othe present interpretation. 
The damping is proportional to the energy loss per cycle A W (equation 
(27)), and the contribution to this energy loss, in that part of the cycle 
where the stress is increased by an amount da, may be represented as 
dkW . This latter quantity is proportional to the increase in non-elastic 
strain corresponding to the stress increment, which in turn is propor- 
tional to the product of the number of dislocations moved dN and the 
mean distance moved L. In the form of an equation* * 

d&W ^ L.dN 

It now seems reasonable, that in successive stages o cold working 
starting from a well-annealed material, the major increase is, at first, 
in the term dN, while L remains essentially constant ; more free disloca- 
tions are made available, but the distance that each may move in one 
burst is fixed by imperfections already present in the lattice. As cold 
working progresses, the crowding of dislocations soon begins to place a 
serious limit on L, which begins to decrease rapidly when the density 
of dislocations is large enough so that they begin to hinder each other's 
motion. The quantity dN cannot become indefinitely large to counter- 
act this decrease in L, since the tendency for relatively free dislocations 
to become "bound dislocations" will limit the number of free disloca- 
tions available. This argument predicts a maximum in the product 
L . dN, and, therefore, in the internal friction, as a function of the 
degree of cold work. The concept of dislocation bursts has, therefore, 
provided a reasonable explanation for the observed maxima in the 
curves of damping versus cold work. 

Amplitude dependent internal friction has only been studied, and its 
mechanism established as one of static hysteresis, at high frequencies. 
No systematic low frequency measurements for well annealed materials 
and for small amounts of cold work have been obtained. (This lack of 
data is partly a result of the fact that low frequency measurements of 
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small values of internal friction are difficult, because the equipment 
itself has thus far been responsible for a residual damping of about 10~ 4 .) 
If the dislocation mechanism just described is correct, however, we 
may anticipate that, given enough time (i.e. at sufficiently low fre- 
quencies) or at higher temperatures of measurement, it should be 
possible for a relatively free dislocation to pull away from its barrier by 
thermal activation, without having to wait for the applied stress to 
become sufficiently large to free it. In other words, the mechanical 
behaviour which is describable in terms of static hysteresis at high 
frequencies may completely change its character to a viscoelastic type 
of mechanism at low frequencies. One model that may be presented is 
that, at low frequencies, instead of dislocations being torn loose from 
impurity atoms that restrain them, they may drag the impurity atoms 
along with them. This behaviour would lead to a viscous creep involving 
a permanent set, since dislocations that have moved have no reason to 
return to their original positions unless a reverse stress is applied. The 
low frequency internal friction may be considerably less amplitude 
dependent than the high frequency damping, and it may show a strong 
frequency dependence, if this interpretation is correct. In fact, the 
simplest description of such behaviour would be the Maxwell solid, 
which represent? elasticity plus viscous flow, and leads to an inverse 
frequency dependence (equation (12)). Unfortunately there are no low 
frequency damping measurements which can be examined to see if 
these ideas are correct. There are static measurements, however, which 
agree very well with the above interpretation. These are the measure- 
ments of "microcreep" of single crystals of tin, by CHALMERS, 133 which 
show that for very small stresses a limited amount of creep occurs. The 
observed initial creep rate is proportional to the applied stress, and the 
rate of creep, for any given stress, approaches zero with time. This 
microcreep is not recoverable in single crystals. It has been attri- 
buted 74 ' 134 to the movement of relatively free dislocations which drag 
impurity atoms along with them. On the basis of this evidence it 
therefore seems logical to suppose that internal friction at high 
frequencies and microcreep may be traced to the same origin: the 
movement of relatively free dislocations which are torn loose by the 
stress when the frequency is high, or which surmount the barrier to their 
movement by an activated process at low frequencies or under static 
conditions. Although this description provides a semblance of order in 
the interpretation of phenomena associated with the movement of 
dislocations at low stress levels and for small degrees of cold work, it 
should be remembered that there is a large unfilled gap between the 
high frequency measurements and the static "microcreep" measure- 
ments. Furthermore, the former has been studied only in the case of 
copper crystals and the latter effect only in the case of tin crystals. 
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Evidently, further experiments are required to see if the predictions are 
substantiated. 

THERMOELASTIC INTERNAL FRICTION 

The sources of internal friction considered up to this point have all 
involved some form of atomic re-arrangement, induced by stress. In 
the present section, damping originating in stress-induced thermal 
variations will be considered. The basic principles involved in this type 
of internal friction have been understood for some time and have been 
adequately reviewed. 1 * 5 The present discussion will briefly summarize 
these principles. 

By the thermoelastic effect, we mean the coupling between internal 
(thermal) and mechanical co-ordinates of a material, as illustrated by 
the expansion of a solid when heated, and the reciprocal effect, the 
cooling during an adiabatic expansion. An element of material, to 
which a small stress is suddenly applied, will instantaneously be 
strained by an amount e', and this strain will be accompanied by a 
change in temperature. If the stress is homogeneous throughout the 
specimen, the same temperature change will occur at every point. On 
the other hand, a non-homogeneous stress sets up a temperature gradient 
in the material, and, therefore, allows heat flow to tal^yfjlace. The flow 
of heat into or out of a given element of the material produces an 
additional non-elastic strain. In general, a conversion of mechanical 
energy into heat, or internal friction, will be a result of this heat flow 
and the associated non-elastic behaviour. 

If a periodically varying non-homogeneous stress is applied to the 
material, a periodically varying temperature gradient is produced. 
If the variations in the stress are so rapid, i.e. the frequency so high, 
that insufficient time is allowed for appreciable heat flow to take place 
during a stress cycle, the process is adiabatic and no energy loss, or 
damping, takes place. Under such conditions the dynamic modulus is 
the modulus M', the quotient of the stress by the instantaneous strain 
E' (equation (9)). The quantity M' is, in this case, known as the 
adiabatic modulus. In the other extreme, at sufficiently low frequencies, 
temperature equilibrium in the specimen is constantly maintained, or 
the process is isothermal. The transfer of mechanical energy into heat 
and vice versa takes place reversibly, so that again there is no mecha- 
nical energy lost (or net heat generated) per cycle. The dynamic 
modulus measured under these conditions is called the isothermal 
modulus and is lower than the adiabatic modulus. In the intermediate 
range of frequencies, where the conditions of application of stress are 
neither adiabatic nor isothermal, the conversion of mechanical energy 
into heat is not reversible and internal friction effects may be observed. 
There are two categories of thermoelastic effects, depending on the 
origin of the non-homogeneity of stress in the specimen. 
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Macroscopic Thermal Currents 

The thermoelastic effect may result from non-homogeneity of stress that 
is inherent in the type of vibration. This phenomenon is demonstrated 
most strikingly for the case of the transverse vibration (flexure) of 
reeds. In this mode of vibration the material that is on the outer or 
convex side at any instant is expanded and, therefore, cooled, while 
that on the concave side is compressed and raised in temperature. Heat 
flow will, therefore, occur across the specimen, if sufficient time is 
allowed. As already pointed out, the thermoelastic internal friction 
shows a maximum as a function of frequency, since very small damping 
occurs at either very low frequencies (the isothermal limit) or at very 
high frequencies (the adiabatic limit). It can be shown 5 that the case 
of a reed in transverse vibration is describable, very nearly, in terms of a 
single time of relaxation (i.e. as a standard linear solid), so that the 
internal friction peak is described by equation (39). The time of 
relaxation r is given by 

.... (63) 



where d is the thickness of the reed and D t the thermal diffusion constant, 
which is 

thermal conductivity 
i (specific heat) (density) 

The relaxation strength A# is the fractional difference between the 
adiabatic and isothermal moduli; it ranges between 10~ 2 and 1()- 4 for 
various materials. The measurements of BENNEWITZ and RoTGER 135 
and of ZENER, OTIS and NucKOLLS 136 have verified the theory in all 
details. 

Macroscopic thermal currents may also occur, at least in principle, 
in a specimen vibrating in a longitudinal mode, where heat flow is to be 
expected between regions of compression and of dilatation. The dis- 
tances between these regions, and, therefore, the time of relaxation 
involved, is too large for the effect to be of any significance when 
specimens are vibrated in the ordinary range of frequencies. We may, 
therefore, always regard longitudinal vibrations as adiabatic with 
respect to macroscopic thermal currents. 

No thermoelastic effect results from a non-homogeneous shear stress, 
as in torsional vibration, because shearing is not accompanied by a 
change of temperature. 

Microscopic Thermal Currents 

In addition to the possibility that non-homogeneity of stress may 
result from the type of vibration employed, a polycrystalline metal also 
offers the possibility of fluctuations in stress from grain to grain, even 
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when the applied stress is macroscopically homogeneous. 137 This micro- 
scopic non-homogeneity is a consequence of the elastic anisotropy of an 
individual grain, combined with the fact that grains are randomly 
oriented in a poly crystalline metal. Fluctuations in stress from one 
grain to another gives rise to heat currents between adjacent grains, and, 
therefore, to an internal friction peak analogous to that obtained as a 
result of macroscopic thermal currents. The peak obtained as a result 
of intergranular thermal currents does not represent a single time of 
relaxation, however, and the exact form of this curve is not readily 
evaluated theoretically. We may expect from dimensional reasoning, 137 
that internal friction will be a function of the frequency, the mean grain 
size d g , and the thermal diffusion constant D t , only through the product 
a)d a /D t . The maximum in the internal friction may be expected when 
this product is close to unity. These predictions are verified by the 
experiments of ZENER and RANDALL. 20 ' 41 

An awareness of the internal friction that originates in thermal 
currents is important when one studies the damping due to other 
sources, if polycrystalline materials are being used, or if the specimen 
is vibrated transversely. Under either of these circumstances, it is 
desirable to work in a frequency range where the vibrations are either 
isothermal or adiabatic with respect to thermal currents, to avoid this 
type of internal friction. 

MAGNETOELASTIC INTERNAL FRICTION, 

The coupling between magnetic and mechanical properties of a ferro- 
magnetic material provides another important source of internal fric- 
tion. This "magnetic damping" is closely related to other ferromagnetic 
phenomena and has, therefore, been reviewed in some detail in the 
general treatises on ferromagnetism by BECKER and DoRiNG 138 and by 
BozoRTH. 139 The present discussion is a brief resume of the factors that 
are responsible for internal friction in ferromagnetic materials. 

Magnetoelastic phenomena are best discussed in terms of the domain 
theory of ferromagnetism, according to which a ferromagnetic substance 
is composed of small "domains" that are individually magnetized to 
saturation. In the demagnetized state of a specimen, there is a random 
distribution of the magnetization vectors of the domains among the 
various crystallographic directions of "easy magnetization". The effect 
of a magnetic field is to produce rotation of the directions of magnetiza- 
tion and movements of domain boundaries so as to align the magnetiza- 
tion vectors as closely as possible to the direction of the field. 

The coupling between magnetic and mechanical properties is provided 
by the phenomenon of magnetostriction, which usually manifests itself 
through the elongation of a material in the direction of magnetization. 
(The opposite effect, a decrease in length of a magnetized specimen 
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represents "negative magnetostriction".) From the appropriate 
reciprocity relation, it follows that application of stress may produce a 
change in the state of magnetization. The effect of stress is, therefore, 
somewhat analogous to that of a magnetic field, in that stress can also 
produce rotation of the magnetization vectors of domains and movement 
of the domain boundaries. The principal difference between the action 
of stress and of a magnetic field is that in a field there is a difference in 
the energy associated with a magnetization parallel and antiparallel to 
the field, while stress is incapable of distinguishing between two 
directions of magnetization 180 apart. 

As a result of magnetoelastic coupling, the effect of stress is to pro- 
duce, in addition to the elastic strain, a magnet ostrictive strain. The 
occurrence of this non-elastic strain leads to both internal friction and 
to what is called, in the present case, the AjEJ effect. At high fields, 
sufficient to align all domains in the direction of the field, the magneto- 
strictive strain vanishes. The value of Young's modulus measured 
under these conditions, i.e. when the material is magnetized to satura- 
tion, is simply the ratio of stress to elastic strain e r and may be desig- 
nated in the usual notation (see equation (9)) as E'. The difference 
between E' an<J the value of Young's modulus measured at zero 
magnetization is^fhe well-known "&E effect." The concept of a AJf 
effect, which appears throughout this article, is a natural generalization 
of this Ai7 effect in magnetic materials. We will see shortly that 
Young's modulus of a ferromagnetic specimen depends not only on the 
field strength but also on the vibration amplitude. 

The work of several authors 138 ' 140 141 has led to the separation of 
sources of internal friction in magnetic materials into three types : 
macro-eddy currents, micro-eddy currents, and static hysteresis. These 
will now be discussed individually. 

Macro-eddy Currents 

The sudden application of a tensile stress to a partially magnetized 
specimen will produce a change in the magnetization. Such a change 
induces surface eddy currents which produce an additional field inside 
the specimen such that the total flux inside is unchanged. Associated 
with these currents are resistive energy losses. The surface eddy 
currents gradually diffuse inwards thereby allowing the magnetic field 
strength in the specimen to return to its original value. This diffusion 
of eddy currents, and, therefore, of magnetic flux, may be regarded 5 as 
analogous to the thermal diffusion which produces thermoelastic internal 
friction. It results in a peak in the curve of internal friction versus 
frequency ; this peak is not expressible as a single time of relaxation, 
however. The damping depends on the thickness of the specimen d, the 
resistivity p, and the angular frequency co only through the quantity 
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d 2 a)/p. In the low frequency limit, damping is proportional to this 
quantity, and in the high frequency limit, the damping varies inversely 
as the square root of this quantity. This type of internal friction is a 
typical example of anelasticity, and shows the usual independence of 
vibration amplitude. There is no damping due to macro-eddy currents 
in a demagnetized specimen, due to the fact the application of stress to 
such a specimen produces no gross magnetization. (This fact is a 
consequence of the inability of stress to distinguish between two 
directions 1 80 apart ; if no preferential direction of magnetization 
exists at zero stress, a finite stress cannot create one.) These charac- 
teristics (the peak obtained as a function of frequency, the amplitude 
independence, and the vanishing of the effect in a demagnetized 
specimen) distinguish the internal friction due to macro -eddy currents 
and make it possible to separate this type of damping from the others. 

Micro -eddy Currents 

Although the application of stress to a demagnetized specimen produces 
no gross magnetization, the fact that there are local variations of 
magnetization within the material, due to the domain structure, means 
that stress will produce local changes of magnetization. These local 
magnetization changes will add vectorially to zero, bilt the ' 'micro-eddy 
currents" that accompany them will be associated with finite total 
energy losses. Like the macro -eddy current case, the damping that 
results from micro currents is independent of vibration amplitude and 
strongly frequency dependent. The two cases differ, however, in that 
micro-eddy currents produce losses even at zero magnetization, and that 
these losses are proportional to the frequency over a wide range of 
frequency, i.e. internal friction does not go through a peak. Further- 
more, this source of damping is highly structure sensitive ; according to 
a calculation of BECKER and DoRiNG, 138 internal friction varies inversely 
as the square of the magnitude of internal stresses. In qualitative 
agreement with this theory, various authors show 142 * 143 that this type 
of damping is reduced by cold working and increased by annealing. 

Magnetoelastic Static Hysteresis 

When measurements are taken at sufficiently low frequencies, eddy 
current losses are effectively eliminated, because the rate of application 
of stress and, therefore, the rate of change of magnetization is very 
small. Under these conditions we obtain a stress -strain hysteresis loop 
analogous to the hysteresis loop of magnetization versus field strength ; 
in both cases the occurrence of hysteresis is associated with sudden 
movements of domain boundaries which produce, irreversibly, a change 
in the size of one domain at the expense of another. This type of stress- 
strain loop is another example of the behaviour that we have termed 
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static hysteresis. Characteristic of such behaviour is frequency indepen- 
dence, and a strong dependence of damping on strain amplitude even 
at low amplitudes of vibration. This type of magnetic damping is 
discussed in detail by BouLANGER. 32 

The similarity in the origin of the magnetization-field strength 
hysteresis loop to that of the stress -strain loop means that the Rayleigh 
law, which states that magnetic hysteresis losses are proportional to the 
cube of the maximum field strength, applies equally well to the mecha- 
nical hysteresis losses. The energy dissipated per cycle should, then, be 



008 



006 - 




oov 



002 - 



Ki. 1 1 . Internal friction of Armco iron as a function of amplitude 

of vibration, at zero field and at saturation. Points indicated by 

different symbols represent different vibration frequencies 

proportional to o^ 3 , where a 1 is the stress amplitude of vibration. Since 
the vibrational energy is proportional to o^ 2 , we find (from equation 
(27)) that the internal friction is proportional to a v This linear ampli- 
tude dependence of damping at low amplitudes has been verified by 
FORSTER and KosTER 144 and by SNOEK. 145 Correspondingly, it is shown 14 * 
that Young's modulus decreases linearly with a v This result again 
shows that for a mechanism of static hysteresis, both the damping and 
AJf effect show the same form of amplitude dependence, as expected 
from equations (49). As the vibration amplitude continues to increase, 
the energy dissipated per cycle must deviate from Rayleigh's law and, 
finally, approach a constant value for stress amplitudes sufficiently 
large to produce a condition of saturation. Correspondingly, the 
internal friction, which initially obeys a first power law must go 
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through a maximum as a function of amplitude and, finally, obey an 
inverse square law. The measurements of BOULANGER, shown in Fig. 11, 
demonstrate this behaviour. From the difference between the internal 
friction curves for zero field and for saturation, the energy loss AJF m , 
due to ferromagnetic effects, can be calculated (the dashed curve of 
Fig. 11). This curve is initially a cubic law and eventually approaches a 
constant value at large amplitudes. The fact that essentially the same 
internal friction curve is obtained at different frequencies guarantees 
that eddy current losses do not contribute to the observed damping. 

OTHER SOURCES OF INTERNAL FRICTION 

The sources of damping already discussed by no means exhaust all 
possibilities, but are simply the ones that have been most thoroughly 
investigated up to the present. The ability of stress to induce atomic 
re-arrangements of various kinds seems to offer wide opportunities for 
further application of internal friction methods, particularly in the 
study of phase transformations. Such applications have, thus far, been 
only of a preliminary nature. 

An important type of phase transformation is the precipitation of a 
second phase, which occurs by a process of nucleation and growth. We 
have already discussed the effect of precipitation on internal friction 
when precipitation is discontinuous and results in the formation of 
internal interfaces. Studies of the precipitation of interstitial solutes in 
b.c.c. lattices, in terms of the rate of disappearance of solute atoms from 
solid solution, have also been discussed. No mention has been made, 
however, of any internal friction produced by the precipitate particles 
themselves, in the early stages of continuous precipitation. Such effects 
have been studied, for the most part, on age harderiable engineering 
materials ; internal friction, as measured at one frequency and at room 
temperature, may then be regarded as another mechanical property 
(like hardness) which varies with the degree of precipitation. On the 
other hand, fundamental studies of such damping effects, in relatively 
simple alloys, may reveal valuable information about the nature of the 
early stages of precipitation, that cannot be obtained by other methods. 
An investigation by KE 146 reveals an "anomalous" internal friction in 
aluminium containing -| per cent copper, which he claims is related to 
the precipitation process and to the presence of a state of cold work. 
Further experiments by the writer on this alloy indicate that cold 
working may not be necessary to this effect, and also show that the 
"anomalous" damping is difficult to reproduce. This phenomenon 
seems to be a sensitive measure of one stage in the precipitation 
process. 

Internal friction methods may also be of value in the case of phase 
transformations that do not occur by nucleation and growth. If the 
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transformation is accompanied by a change of dimensions, the applica- 
tion of stress to the material will induce or retard the transformation at 
temperatures in the immediate vicinity of the transformation tempera- 
ture. Stated differently, the lattice is relatively unstable near the 
transformation temperature and may be greatly influenced by an 
external stress. Large non-elastic deformation, or plasticity, and large 
internal friction effects are, therefore, expected in the vicinity of the 
transformation temperature. Such behaviour has been observed for the 
/? to transition in Ag-Zn alloys, 97 for the A-point transition in ammo- 
nium chloride, 147 and for the a to y transformation in ferronickels. 148 ' 149 
The mechanism of this type of internal friction is, of course, closely 
related to the mechanism of the t particular phase transformation, and, 
therefore, deserves further detailed investigation. 
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2 

THE MECHANISM OF OXIDATION OF 

METALS AND ALLOYS AT HIGH 

TEMPERATURES 

Karl Hauffe* 

INTRODUCTION 

THE reaction between a solid metal and oxygen or an oxidizing gas 
(e.g. a halogen or sulphur) seems to be one of the simplest chemical 
reactions. Experiments show, however, that the mechanism is rather 
complex, because in many cases the reaction product separates the 
reactants and forms a compact layer of oxide or halide on the metal 
surface. The oxidation can advance only as fast as the reactants diffuse 
across the oxide layer. It is therefore of interest to investigate the 
structure of these layers and the mechanism whereby atoms, ions and 
electrons diffuse m them. Assuming that there is no diffusion through 
pores or along grain boundaries, WAGNER 79 made the working hypo- 
thesis that only cations and anions migrate across the compact layer, 
and that the migration of electrically neutral atoms or molecules may 
be neglected. In the next chapter we shall consider the laws of motion 
of ions and electrons within the crystal. We shall sec that WAGNER'S 
theory of tarnishing reactions 79 " 81 ' 86 and its applications to alloys by 
WAGNER 82 ' 83 arid HAUFFE and associates 30 ' 31 * 20> 2l has some success in 
explaining experimental observations. 

The interconnexion of diffusion processes and phase boundary 
reactions is pictured in Fig. 1. Sometimes, the reactions at the phase 
boundaries may be rate-determining factors. In most cases, however, 
the migration of ions controls the rate of oxidation. On this basis it may 
be assumed either that metal ions and electrons migrate from the inter- 
face between metal and oxide across the oxide to the outer surface or 
that oxygen ions migrate in the opposite direction. 

Reactions of metals and alloys with oxygen, sulphur and halogens at 
high temperatures are called tarnishing reactions. We must distinguish 
between oxidation processes leading to thick oxide layers and those 
leading to thin (or very thin) films. In the first case it is to be expected 
that the rate of oxidation is inversely proportional to the thickness of 
the oxide layer (or proportional to the square root of the time) and that 
electroneutrality holds. Under these conditions WAGNER'S theory is 

* At present Professor of Physical Chemistry at the University of Berlin, Germany. 
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Metal 



Oxide 



Oxygen 



valid. At the other extreme, in very thin films, there is a diffuse electric 
double layer, where the electrical field strength is so high that the rate of 
migration of ions is no longer proportional to the field strength, i.e. it is 
no longer inversely proportional to the thickness of the oxide film, but 
depends quasi -exponentially upon the thickness. MoTT 65 was the first 
to show that under these conditions the rate of oxidation will decrease 
more rapidly with increasing thickness of the oxide film than is accounted 
for by the parabolic law. This is one of the various ways of interpreting 
the so-called logarithmic rate law. In Chapter 3.1 the formation of thin 

Fig. 1. Migration processes 
and phase boundary reactions 
during the oxidation of a 
metal forming a defect semi- 
conducting oxide layer. (Here 
and McD' represent an 
electron hole and a metal ion 
vacancy, respectively. The 
expression "electron hole" in- 
dicates that somewhere in the 
lattice of the compound an 
electron is missing. Fig. 1 
shows tli^U* electrons migrate 
by filling electron holes and 
ions migrate by filling ion 
vacancies) 

films will be discussed in general, using the theories of CABRERA and 
MoTT 5 and ENGELL and HAUFFE. 12 

For thick layers, where the concentrations or activities of the 
reactants at the two boundaries of the oxide layer are independent of 
time, and thermodynamic equilibrium at the respective interfaces is 
practically established, the reaction rate obeys Fick's law, giving the 
increase of thickness A# per unit time t, 

k f 

Aa; 




Men' 



Me-Atoni+Mea'+ ^ 



where k' is a constant. Upon integration, 

(Az) 2 = 2k' t 



(2) 



Instead of measuring the thickness it is often easier to determine the 
increase of mass in grammes per cm 2 Am/^4. Equation (2) then leads to 



("Am\ 2 
~T) = 2k l 



(3) 



This is the parabolic rate law, which was first derived by TAMMANN 73 in 
1920 and independently by PILLING and BEDWORTH 67 in 1923. 
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THE FORMATION OF THICK COMPACT LAYERS OF OXIDE 
ON A METAL SURFACE 

A compact layer is formed, if the molar volume of the oxide is equal 
to or greater than that of the metal. This general rule is only a rough 
one and requires modifying in view of experimental results. The 
theoretical basis for understanding the departure from this rule has 
been given by FRANK arid VAN DER MERWE. I? According to these 
authors it is reasonable to assume that, if the degree of misfit is less 
than about 15 per cent, the layer will take up the lattice parameter of 
the metal. If it is, however, greater than about 15 per cent, it will take 
up very nearly its own unstrained lattice parameter. In the latter 
case the layer can break away* from the surface or become porous. 
In the first part of this report we shall confine ourselves to thick 
compact layers. In 4-3 thin layers will be dealt with. 

Lattice Defects in Ionic Crystals 

Motion of ions in a crystal lattice is understandable, if one assumes, as 
FRENKEL has done, that not all ions are in their normal lattice positions. 18 
In pure silver halides some cations have 

left their riormaTlattice points to occupy Ag+ A ^ r _^ Aq+ Br Ag * Br ~ Ag * 
interstitial positions; in this way one Br - * + Br _ Q Br - A + Br - 
cation vacancy appears per interstitial / x 

ion, as is shown in Fig. 2. The anion Ag+ Br- V Br " A ^9 Br- A g + 
lattice is practically undisturbed. In a r - Q g r - Aq^Br Aq+ Br~ 

concentration gradient or a weak elec- 

i-i-ciixu -i ^ig- 2. Lattice defects in silver 

tncal field the silver ions can move via bromide Accordingt <>FRENKui, 

interstitial positions or via vacancies. f vacant silver sites indicated by 

FRENKEL'S assumptions well explain a square) 

experimental results. TUBANDT showed 

that, in silver halides, practically only silver ions are mobile. 75 ' 76 
Furthermore, energy calculations led JosT 51 > 52 to the conclusion that 
for the silver halides less energy is required to move a silver ion from a 
normal to an interstitial site than to form a pair of vacancies (proposed 
by SCHOTTKY for alkali halides and discussed below) by moving a cation 
and an anion to the surface. More detailed calculations by MOTT and 
LITTLETON 63 somewhat modified this. Experiments involving the 
addition of the halides of divalent metals to silver halides, first carried 
out by KOCH and WAGNER, 65 have confirmed JOST'S viewpoint. 
TELTOW 74 has repeated these experiments with a considerable increase 
in experimental accuracy and also in the number of determinations 
with additions of divalent metal salts such as the halides of lead and 
cadmium. 

Fig. 3 shows another type of defect, first discussed by SCHOTTKY. 71 
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Under the influence of thermal motion cations and anions near the 
surface of the alkali halides leave their normal lattice points diffusing 
to the surface and leaving vacancies behind. These vacancies can 
move in the lattice and their existence thus makes the normal ions more 
mobile. Experiments to determine transport numbers show that the 
conductivity is entirely ionic in character and due to the migration of 

both cations and anions. The temperature 
Na+ Cl- Na+ Clr Na+ Cl~ Na+ coeffident of mobi lit y w m usually be dif- 

Cl" D Cr Na + Cl" Na* Cl~ ferent for the two ions. The energy required 
Na + Cl" Na+ Cl~ Na + D Na + to form a vacancy was first computed by 
Cl~ Na+ Cl" Na + Cl~ Na+ Cl" JOST and NEHLEP 53 and later investigated 

^. r LL . , c . . by more refined methods by MOTT and 

Fig. 3. Lattice defects m J J 

sodium chloride. Accord- LITTLETON. 63 &CHOTTKY defects seem to be 

ing to SCHOTTKY the rule also in 'some other crystals, e.g. 

in lead halides and some sulphides and oxides. 

These types of lattice defects are only to be found in stoichiometric 
crystals. The WAGNER-SCHOTTKY disorder theory of crystals explains 
experimental observations and, moreover, its application led to further 
fruitful experiments. Thus WAGNER and co-workers 81 have developed 
the basic principles of ionic crystals possessing deviations from the ideal, 
integral, stoichiometric composition. In solids like Cu 2 O there are 
cation vacancies Gun' and electron holes . From the corpuscular 
viewpoint, therefore, an electron hole is represented by a Cu 24 ~ ion (see 
Fig. 4). In the presence of oxygen, cuprous ions diffuse from the interior 
to the surface, there combining with chemisorbed oxygen which, on 
account of its electron affinity, has u + + Cu + Q U+ ^ u+ ^+ 
captured electrons, forming more Cu 2 Q = (T = = 

molecules at the surface. It is, there- Cu + D Cu + Cu + * Cu+ 
fore, evident that the number of Cu + - = = = = 

vacancies and electron holes increases Cu* Cu* Cu* D Cu* 
with increasing oxygen pressure. Since Fig . 4> Lattice defects in 
in most cases the concentration of cuprous oxide. According to 
lattice defects is relatively low, one WAGNER. Squares indicate 
may tentatively apply the laws of vacant copper ion lattice points. 
1 i i ,. TT i xi A divalent copper ion repre- 

ideal solutions. Under these assump- ,. , , 11 

r senting an electron hole 

tions we can write 

iQ 2 (g) ^ Cu 2 + 2Cun' + 2 .... (4) 

Applying the law of mass action we have 

r 2 r -2 ft n 1/2 /K\ 

c Cu[ ]' * C A Po a .... (O) 

where K is the equilibrium constant and c e is the concentration of 
electron holes, which is equal to the concentration of cuprous ion 
vacancies c CuLJ /. Since the electrical conductivity K is proportional to 

c ffi , we have , 1/rt //>x 

e> K = const . p 0i ' .... (6) 

74 



MECHANISM OF OXIDATION OF METALS AND ALLOYS 

WAGNER and associates 9 * 27 find K oc p 0j 1/7 in good agreement with 
theory. GUNDERMANN and WAGNER 28 showed that the conductivity 
due to cuprous ions is also proportional to Po a 1/7 - NiO behaves in a 
similar manner. Semi-conductors in which the current is carried by 
electron holes are called electron-defect semi-conductors (or p-semi- 
conductors). Such conductors are Cu 2 0, NiO, 3 FeO, 85 CoO, 85 Bi 2 3 , 57 
Cr 2 3 . 33 

Solids containing an excess of metal are in distinct contrast to the 
above group of oxides. A representative of this group is zinc oxide. 
An excess of metal may be present in the form of zinc ions in interstitial 
positions and an equivalent number of quasi-free electrons O as shown 
in Fig. 5. At high temperatures. ZnO dissociates, as follows 

ZnO ^ ZnO' + 9 + i0 2 (g) .... (la) 

or 

ZnO ^ZnO" + 20 + -|0 2 (g) .... (Ib) 

In these equations ZnO* and ZnO", Zn+ + 0= Zn+ + = Zn++ = 

respectively, represent an interstitial e ~ Zn* 

zinc ion, the point to the right of the ~ Zn** = > Zrf+ = Zn*? 

circle indicating the positive net-charge. n e 

c Mt.j A 7n+ + n= 7-n ++ r\~ 7^++ f\~ 

According to MILLER'S self-diffusion ^ u Zn u Zn 
measurements at 1000C 61 the assump- Fi - 5 - Tentative scheme of 

tion of the existence of interstitial zinc lattice defect 1 s in zinc oxide as 
. , , , . , . , , , . , , . , proposed by WAG NICK 

is better justified than is the existence 

of oxygen ion vacancies. Again applying the law of mass action we 
obtain for the electrical conductivity 

K const. p 0a ~ 1/4 .... (8a) 

K = const. p ( ) a ~ 1/6 respectively .... (86) 

In agreement with (8) it is found experimentally that the electrical 
conductivity of zinc oxide decreases with increasing partial pressure of 
oxygen at 600C 2 ; K oc p Oa 1/4 to p 02 ~ 1/5 - CdO, 2 TiO 2 , 10 > 34 A1 2 3 29 
behave similarly. The special problems encountered in interpreting 
the observed properties of intrinsic semi-conductors such as CuO 35 will 
not be discussed, because it is not yet known what type of lattice 
defects prevail in this case. 

Wagner's Theory of Oxidation of Metals 

Following WAGNER we shall now derive the general formula of high- 
temperature oxidation of metals, which expresses the rate of oxidation 
in terms of the free energy of oxidation, the electrical conductivity, 
and the transport numbers of ions and electrons. Assuming that the 
ions and electrons migrate independently of each other, one obtains 
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for u i9 the average drift velocity in cm . s" 1 of particles of type i in 
the direction of the #-axis 



where B i represents the mobility of particles of type i, i.e. the steady- 
state velocity due to a unit force per particle, corresponding to a 
change in the chemical potential per particle of one unit of energy 
( 1 erg) for a displacement of one unit of length ( 1 cm). N is 
Avogadro's number, e is the electronic charge, </> is the local electrical 
potential, z l is the electrical valency and ju i is the chemical potential 
of a mole of particles of type i in ergs per mole. 

If nJA is the number of moles migrating per unit cross section per 
unit time, then there follows from equation c ( 9) 



where c t is the concentration of particles of type i in moles per unit 
volume (= 1 cm 3 ). 

As it is often desirable to eliminate B we introduce the electric 
current / (in amperes) and the electric conductivity fc'^ohm" 1 . cm" 1 ) in 
combination with the electric field strength (d<f>jdx) in absolute units 
and the factor 300 to convert to practical units by means of the relation 

dcf> 

I = ,4*300 -- 
dx 

In the case of electrolysis d/Li/dx is zero, and we obtain for the transport 
of ions xx , . 

j_j j __ __ . i K . _j_. (1) 

\AJ electrolysis 96500 l dx 

where t i is the transport number of particles of type i. Comparing 
equations (10) and (12) we get 

*, = J00_ . __ /_ d^._ d-A 

A 96500 \z t \eN\ dx ^ dx]' ' ' v 

In view of the chemical equilibrium between ions, electrons and elec- 
trically neutral atoms, we have 



/i x = // 2 +z a //3, .... (146) 

where ^M Me and /f x are the chemical potentials of the electrically 
neutral reactants, the metal Me and the non-metal X, respectively. 
These chemical potentials // Me and /j x are not independent of each 
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other. According to the special form of the Gibbs-Duhem equation, 

we have in the case of a small deviation from a stoichiometricall} 
composed solid 

d/^ Me f^f d// x . .... (15 

Z 2 

On the basis of these equations WAGNER derives the following 
equation for the formation of an oxide, halide or sulphide 



n equ _ 7, _ 

~A ~ r A* 

where n eq JA is the rate of both outward migration of metal ions anc 
inward migration of non-metal ions expressed in equivalents pei 
centimetre per second, Ao; is the instantaneous thickness of the reactior 
product in centimetres, e is the electronic charge in absolute electro 
static units, z 2 is the valence of negatively charged anions, JLL X is the 
chemical potential of the non-metal in ergs per gram atom, where t 
single dash and the double dash following the JLI X indicate the equilibriun 
between the reaction product and the metal and between the reactior 
product and the non-metal, respectively; t v t 2 and t 3 are the transporl 
numbers of metal ions, non-metal ions and electrons, respectively, anc 
the constant k r is called "rational rate constant," i.e. the reaction rate 
in equivalents per centimetre per second for a layer of the reactior 
product of 1 cm thickness. 

Recently WAGNER SI has derived a formula, similar to equation (16) 
which contains only the self -diffusion coefficients D* and D 2 * of meta 
and non-metal ions, respectively. Using EINSTEIN'S equation, 

D t * = BJcT .... (17 

WAGNER finally obtains the following expression for the rational rate 
constant k r 



. . . . (18 



= c f" A "' (A* + ^ D 

Ja ltf , \ Z l 

e' L 



where c = c l /z l = c 2 /z 2 is the concentration of metal or non-metal ionj 
in equivalents per cubic centimetre and a Me and a x are the correspond 
ing activities. This equation, however, is fulfilled only if the transporl 
number of electrons 3 is practically equal to unity. 
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Both equations (16) and (18) are useful for the calculation of the 
rate constant of oxidation reactions. To use equation (16) the electrical 
conductivity and the transport numbers of ions and electrons as 
functions of the partial pressure of the non-metal are required. Use of 
equation (18) is recommended if the determination of self-diffusion 
coefficients is easier than the determination of transport numbers. 

As already pointed out by HAUFFE, 30 36 on the basis of WAGNER'S 
theory of oxidation, the chemical viewpoint, i.e. the distinction between 
oxidation, halogenation and sulphuration reactions, is not of para- 
mount importance. We must, however, clearly distinguish between 
oxidation processes in which the intermediate layer is mainly an 
electronic conductor and those in whiqh it is mainly an ionic conductor. 
In the former case we must also distinguish between conductors in 
which the current is carried by quasi-free Electrons and conductors in 
which the current is carried by electron holes. This distinction is of 
decisive importance for tarnishing reactions. 

According to the general equation (16) the electronic conductivity 
is the rate determining factor if the intermediate layer is an ionic 
conductor. An example is the formation of AgBr on a silver surface 
exposed to bromine vapour. Here the rate of brominatjon will decrease 
with decreasing electronic conductivity and decreasing concentration 
of electron holes. In contrast to these systems with an ionic conducting 
layer the rate of oxidation can be increased or decreased in systems with 
an electronic conductor as intermediate layer if the concentration of the 
lattice defects is altered, for in this case the ionic conductivity is the 
rate determining factor. 

Oxidation Processes Involving Ionic Conducting Layers 

Consider the reaction between silver and gaseous bromine. According 
to the general scheme silver ions and electrons migrate from the inter- 
face between Ag and AgBr across the bromide to the outer surface and 
here they react with adsorbed bromine atoms. Whereas the concentra- 
tion of silver vacancies and silver ions in interstitial positions is 
independent of the vapour pressure of the bromine, the concentration 
of electron holes is a function of the partial vapour pressure of the 
bromine. According to WAGNER we have 80 

lBr a <*>^ AgBr + Ago' + (19) 

Applying the law of mass action we obtain for the hole conductivity 

Ke = K s(pBr>=1) p Bri ] / 2 , .... (20) 

and finally by combining with equation (16) for the rate of bromination 



n 



AgBr 



I 300 RT 

= l 



I/.) l 

Br - /Ai 



196500 "<PBr.-jv c . j AX 
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if p Brj = p Bi /' ^> p Br> '. In agreement with this, WAGNER was able to 
show that between 200 and 400C the rate of bromination is propor- 
tional to the square root of the bromine vapour pressure. Furthermore 
the transport number t 3 of electron holes at 200C and p Brj = 0-23 atm 
is expected from bromination experiments to be about 17 per cent, 
whereas from conductivity measurements it was found to be 12 per 
cent. 80 This result is in satisfactory agreement with WAGNER'S assump- 
tions. The same mechanism may be assumed to apply to the reaction 
of silver with chlorine. 

Oxidation Processes Involving Excess Semi -conducting Layers 

Next, we consider the reaction between zinc and oxygen. Experiments 
showed that the electric conductivity of zinc oxide is mainly due to 
quasi-free electrons and* to a lesser extent, to interstitial zinc ions. 
Consequently, in contrast to the halogenation of silver, the rate 
determining factor in the oxidation of zinc is the migration of zinc ions 
via interstitial positions, i.e. the ionic conductivity. Applying the law 
of mass action to equation (7) 

c . c_ = Kp - 1 / 2 ; .... (22) 

and putting c ZnO ' = c , we obtain for the conductivity due to zinc ions 

K K _ p ~ 1 / 4 . . . . (23) 

Proceeding in the same way as in the derivation of equation (21) we 
finally obtain for the rate of oxidation 

I 300 



A 196500 



(24) 

(24) 



It follows from equation (24) that the rate of oxidation of zinc is 
independent of the partial pressure of oxygen. This conclusion was 
experimentally confirmed by WAGNER and 
GRUENEWALD 86 and can easily be under- 
stood. Fig. 6 shows the distribution of 
interstitial zinc ions and of quasi-free 
electrons in the lattice. Near the oxide/ 
metal interface the oxide is rich in inter- z n 
stitial zinc ions and quasi-free electrons, 
whereas their concentration at the outer _ 
surface is very small. It is, therefore, * "" 

apparent that the concentration gradient Fi s- - Concentration gradi- 

of ZnO- in zinc oxide, and thus the rate ents in nc oxide at oxygen 
c . T , . ~ . , T T , , . pressures of 0-1 atm (Curve 

of oxidation of zinc, should be almost i) and o-OOl atm (Curve 2) 
independent of the oxygen pressure. 

The rate constant of zinc oxidation, as found by WAGNER and 
GRUENEWALD, is k r = 7-5 . lO" 11 g 2 cm~ 4 hr 1 at 400C. In contrast to 
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this, the rate constant of oxidation of spectroscopically pure zinc under 
exactly similar experimental conditions was found by GENSCH and 
HAUFFE 20 to be k r = 9 . 10- 10 at 390C. It must be assumed that the 
high value of k r is a sign of great purity of the zinc used. The oxidation 
of zinc alloys, mentioned below, will show that this conclusion is 
justified. 

It is to be expected that the oxidation of cadmium proceeds according 
to the same mechanism, for FAIVRE and MiCHEL 14 found that an 
increasing excess of metal leads to an increase of the lattice parameter. 
This as well as the dependence of the electric conductivity on the 
partial pressure of oxygen makes it probable that there are metal 
ions in interstitial positions, and quasi-free electrons. Conductivity 
measurements carried out by EARLE, IG and HAUFFE and associates 34 
on Ti0 2 show that this, too, is an excess semi-conductor ; however, it 
contains no interstitial cations, but oxygen ion vacancies as shown by 
EnRLiCH. 11 GULBRANSEN and associates 22 have investigated the 
dependence of the rate of oxidation of titanium on oxygen pressure. 

Oxidation Processes Involving Deficit Semi -conducting Layers 

Good examples of this type of process are the reactions of iron and 
copper with oxygen, the wiistite-phase and cuprous oxide having been 
investigated in particular by JETTE and FooTE, 49 DARKEN and GURRY, 8 
and by WAGNER and associates. 85 Unfortunately the rate of oxidation 
of iron to FeO at 950 and 1000C is determined by surface reactions. 39 
We shall, therefore, rather consider the oxidation of copper investigated 
by WAGNER and GRUENEWALD. 86 

Since Cu 2 is an electronic conductor, the velocity of diffusion of 
copper ions and electrons is determined by the more slowly moving 
copper ions, moving via vacancies. According to equation (16) the 
copper ion conductivity /c Cll = K . t is the rate determining factor. 
Using the experimental result 

fcu = fcu(p 0i -i)Po i 1/7 .... (25) 

and the relation 

dft, = id/i 0i = JflTdlnpo. .... (26) 

we obtain for the rate of oxidation of copper 

tt Gu o 300 7 RT 7 _ __, 1 

~Z' = 9^* C * PO -- 1) 4^<VP : ~ Vpo/) ^ ( 27fl ) 
or 



The mobility of the cation vacancies can be calculated from the electric 
conductivity, the transport number of the cations, and the copper 
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deficit in cuprous oxide. These data have been used by WAGNER and 
GRUENEWALD to calculate the values of the rational rate constant k r 
as indicated in Table 1 . Observed and calculated values of k r are in good 
agreement. Furthermore, the experimental observations led to 
k r oc p 0a 1/7 > in fairly good agreement with the theoretically expected 
relation k r oc p , 1/s (see Fig. 7). 

TABLE 1 

Calculated and Measured Rate Constants of the Formation of 

Cuprous Oxide on Copper at 1000C as measured by WAGNPJR 

and GRUENEWALD 



Oxygen Pressure 



in aim 



8-3 . ID' 2 
1-5 . 10- 2 
2-3 . 10- 3 
3-0 . lO' 4 



Rational Rate Constant k r 
(equivalents . cm"' 1 . R ~ J ) 



(Calculated 



CrG . 10 9 
4-8 . 10 9 
3-4 . 10 9 
2-1 . 10 9 



Observed 



6-2 . 10- 
4-5 . 10-9 
3-1 10- 9 
2-2 . 10-9 
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Fig. 7. Dependence of the rate of oxidation of copper on the? partial 
pressure of oxygen at 1000C according to WAGNER arid GRUENEWALD 

SHOCKLEY and associates, 1 and CASTELLAN and MOORE, 7 have 
investigated the self-diffusion of copper in cuprous oxide which was 
growing on copper but unfortunately the cuprous oxide was not 
homogeneous. Therefore their self-diffusion coefficients cannot be used 
in WAGNER'S third formula (27c) for the rate of oxidation of copper 
(cf. equation (18)), namely 

*r = (1 + ZCuO<^' {1 ~ ('CUQ'/CCUIJ')} ...- (27C) 

where D*u* is the self -diffusion coefficient of copper ions in cuprous 
oxide in equilibrium with oxygen. 

81 



PROGRESS IN METAL PHYSICS 

In the same manner WAGNER and GRUENEWALD have investigated 
the oxidation of nickel and found k r oc p 0a 1/4 , whereas theoretically 
k r oc p / /6 should be obtained (see equations (48) and (49)). The dis- 
crepancy between experimental results and theoretical expectation is 
not yet understood. Recently HAITFFE and RAHMEL 37 have investigated 
the reaction between iron and sulphur at 670C. According to the 
equation 

7 ]S 2 <K> - FeS + Fen" + 2 .... (28) 

the rate of sulphuration of iron is a linear function of the 1/4 and 1/7 
power of the sulphur vapour pressure (see Fig. 8). The theoretical 
relation is k r oc p Sa 1/6 . The agreement is fairly good. 
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Fig. 8. Dependence of the rate of sulphuratiori of iron on the 

partial pressure of sulphur at 670 C according to MAUFFE and 

HAHMEL (k in g 2 . cm~ 4 . s" 1 ) 

APPLICATION OF WAGNER'S THEORY OF OXIDATION TO THE 
OXIDATION OF ALLOYS 

In the above chapters we considered the oxidation of pure metals and 
developed quantitative relations between transport of both ions and 
electrons and the rate of oxidation of metals. In this chapter we shall 
consider the influence of the additions of small amounts of a foreign 
metal on the rate of oxidation. Accepting WAGNER'S theory of oxida- 
tion and assuming that the mobilities of the disordered ions and electrons 
will not be markedly changed by the presence of foreign ions, we must 
diminish the concentration of ionic or electronic defects in order to 
obtain a decrease in the rate of oxidation. Conversely the velocity of 
oxidation will increase on increasing the concentration of ionic defects, 
electron holes or quasi-free electrons. To understand this we must 
consider the application of the law of mass action to ionic and electronic 
lattice defects in mixed crystals. 

Oxidation Processes Involving Ionic Conducting Layers 

WAGNER, 87 HAUFFE and associates 32 - 35 * 38 > 40 have shown that the 

concentration of vacancies and ions in interstitial positions, or of 
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electron holes and quasi-free electrons, and likewise the electrical 
conductivity can be greatly varied by the addition of ions of higher or 
lower valencies. This will be shown for the case of silver bromide. For 
a FBENKEL type of defect we have the following equilibrium 

AgBr undtaturbcd ^Agn' + AgO-, .... (29) 

and with the law of mass action we obtain 

C AG' C AgO' == K- .... (30) 

This equation is valid for pure silver bromide and for silver bromide 
containing small quantities of another metal 

bromide. In the case of pure silver bromide A 9* Br A 9 + Br C(l ** Br 
we have the special condition Br" D Br" Ag + Br" Ag+ 

A 9* Br- Ag* Br" D Br~ 

Br- Ag<- Br" Cd" Br" Ag + 
If cadmium bromide is dissolved in silver 

bromide, divalent cadmium ions replace Fig. 9 Lattice defects in 

... - . \ . a mixed crystal of AgBr- 

silver ions and one cation vacancy AgD cdBpa as proposed by 

appears for each divalent cadmium ion re- WAONKK 

placing a silver ion in the lattice Cd0*(Ag) 

(see Fig. 9). The equation for the solution of cadmium bromide in 
silver bromide then reads 

CdBr 2 ^ Cd*'(Ag) + AgU' + 2 AgBr ..... (32) 

Consequently, because of electro -neutrality, in a binary solution the 
number of cation vacancies is equal to the number of cadmium ions 
plus the number of silver ions in interstitial positions 

C A K D' = C AgO' + C Cd0'(Ag)' .... (32) 

Using (30), (31) and (32) we have 



where c cd - c c<10 . (Ag) . 

It follows from (33) that on increasing the concentration of cadmium 
bromide in silver bromide the concentration of cation vacancies and, 
therefore, the electric conductivity will increase. Experiments carried 
out by KOCH and WAGNER 54 and TELTOW 74 confirm the above con- 
clusion. This result is of importance for the interpretation of the rate 
of bromination of silver alloys as determined by HAUFFE and 

GENSCH. 31 ' 21 

For the rate of bromination of silver and silver alloys the electron 
hole conductivity is the rate determining factor and we must, therefore, 
find a relation between the hole conductivity and the cadmium con- 
centration in silver and the CdBr 2 concentration in AgBr, respectively. 
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Applying the law of mass action to (19) at constant bromine vapour 
pressure 

c Agn , . c e = K (for p nri = const) (34) 

Observing the relation 



Is' 



(35) 



10 




008 



Fig. 10. The dependence of the ratio of the rate constant of broniination 
of the silver- cadmium, alloy, 7v, to that of the pure silver, k, on the 
amount of cadmium added N(^ (atom fraction) as measured by HAUFFE 
and GENSCH (T 330 C arid p# ra 170 mm Hg), Cur*,3 1 observed 
and Curve 2 calculated 

and using equation (33) we finally obtain for the ratio of the oxidation 
rate of silver-cadmium alloys k to that of pure silver k 






c rl \ z ~ I 1 
-41 + 1 

* f* t I 

*"^'4i/f I'/ J 



. . . (36) 



Using TELTOW'S value for c^y = 2-9 . 10~ 3 at 330C the calculated 
values of k agree satisfactorily with the values of k obtained by the 
experiments shown in Fig. 10. Silver-lead and silver-zinc alloys 
behave similarly under identical experimental conditions (see Fig. 11). 



I 




Ag 001 003 



005 



007 



Fig. 1 1 . Dependence of the rate constant of bromination of the silver 

alloy, k r (equ. cm" 1 s^ 1 ), on the increasing amount of cadmium, lead 

or zinc added as measured by GENSCH and HAUFFE (T = 330 and 

PBr, = 170 mm Hg) 
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According to the order of increasing solubility, ZnBr 2 < PbBr 2 
< CdBr 2 , we have 

/ k \ Ag/cd / ft \ Ag/pb / k \ Ag/Zn 

\ ft /min \ ft/ min \ ft/ min 

Similar results were obtained by HiMMLER, 47 who found that the rate 
of chlorination of silver is decreased by the presence of lead chloride 
and cadmium chloride. The chance of developing silver alloys more 
resistent to attack by halogens at high temperatures than silver-cadmium 
alloys does not seem to be very great at present, if it is remembered 
that only metals which are soluble in silver, whose ionic radius is 
1-1 : 0-2 A and whose valency is 2 or 3 
can be used. Only if the radius falls within 
the above limits can a solubility in AgBr 
large enough to produce a high concentra- 
tion of silver ion vacancies be expected. 
Employing GOLDSCHMIDT'S values for the 
ionic radii only the following cations need be 
considered: Cd 2 + (1-03), Pb 2 * (1-32), Cu 2 + 
(0-96), T1 3 + (1-05), Sr 2 + (1-27) and La 3 + 
(1-22). The use of silver alloys containing 
more than one alloying metal is not desir- 
able, because the more soluble metal ion 
diminishes the solubility of the other metal 
ions in silver halides. 89 The resistance of 
ternary silver alloys is thereby often reduced. 




ZnO 2 

Mole % Foreign Oxide 



Quasi -free 



Fig. J2. Variation of the 
electrical conductivity of 
zinc oxide with increasing 
amounts of trivalent and 
inonovalent cations at 
390 C as measured by 
HAUFFE and VIKRK 
(Pair = 1 aim) 



Oxidation Processes Involving 
Electron Conducting Layers 

We shall now consider the influence of 
small amounts of aluminium and lithium in 
zinc on the rate of oxidation of zinc. If we 
oxidize a zinc-aluminium alloy at 390C in an atmosphere of oxygen, 
a layer of zinc oxide containing A1 2 3 is formed. Measurements of 
the electric conductivity of zinc oxide carried out by HAUFFE and 
ViERK 40 (see Fig. 12) showed that with increasing amounts of A1 2 O 3 
the concentration of quasi-free electrons increases while that of zinc 
ions in interstitial positions decreases. The equation corresponding to 
the solution of A1 2 3 in zinc oxide is 

A1 2 3 ^ 2A'(Zn) + 29 + 2ZnO + i0 2 (K) . . . (37a) 
O 2 ( K> + 2ZnO* + A1 2 3 ^ 2Al*'(Zn) + 4ZnO. . . (376) 



or 



Fig. 13 shows aluminium ions in zinc ion positions. Because of 
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electro-neutrality quasi-free electrons will appear while interstitial 
zinc ions will disappear ; for the same reason we have 

C Q C ZnO' + C A10'(Zn)' .... (38) 

Thus, using equation (38) and substituting in equation (22) at constant 
oxygen pressure, we obtain for the ratio of the rate of oxidation constants 
of the Zn/Al alloy k to that of the pure zinc k 

k _ 1 

. .. ^ 

+ 



(39) 



where c Al = c AI ^. (Zn) is the concentration of aluminium ions at zinc ion 
lattice points. 

Measurements of the electric conductivity of mixtures of zinc oxide 



Za*+ 0= Zn 4+ 0= 
Zn + e~ 

0= ZnT 0= Zn 



0= 



l*~ 0= 



0= Zn"" 0= 



0= 



= 



Zn" 0= Zn** 0= Li+ 

Zn + Zn + 

0= ZTI+ 0= Zn+ 0= 

Zn* e~ 

U + 0= Zn ++ 0= Zn+ 



= 



0= 



= 



Pig. 13. Lattice defects in 
a mixed crystal of ZnO- 



Fig. 14. Lattice defects in 
a mixed crystal of ZnO- 



and aluminium oxide at 400C showed that the concentration of inter- 
stitial zinc ions in pure ZnO can be estimated as c Zn0 . ^ 4 . IG" 5 . 40 It is 
easy to show, therefore, that if the addition amounts to more than 
(M atom per cent, equation (39) becomes 



(40) 



A1 



We can treat the oxidation of a zinc-lithium alloy in the same way. 
For zinc oxide containing lithium oxide we have 



20 + Li0 



2ZnO. 



(41) 



Assuming the validity of equation (41) we see that adding lithium oxide 
to zinc oxide causes an increase in the concentration of interstitial zinc 
ions and a decrease in the concentration of quasi-free electrons (see 
Fig. 14). 

Furthermore, the condition of electro-neutrality requires 

C ZnO' ~ C + C Li0'(Zn)' .... (42) 

Thus, combining equation (42) and the equilibrium condition at con- 
stant oxygen pressure, equation (22), we finally obtain for the ratio of 
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the oxidation rate constant of the Zn/Li alloy k to that of pure zinc k 



and if 



A: 2c 

CLI = c 



n0 . 



'ZnO 

t o 
'ZnO' 

C ZnO' 



....(43, 



(44) 



The experimental results obtained by GENSCH and HAUFFE 20 confirm 
the validity of the assumption made (see Fig. 15). Table 2 shows the 
observed and the calculated values for the oxidation rate constants of 
spectroscopically pure zinc and for zinc alloys. The curious dependence 

TABLE 2 

Calculated and Measured Rate Constants of tho Formation 
of Oxide on Zinc and Zinc Alloys at 390 C, as measured by 
rH and HAUFFE (p 0y - 1 atm) 



Rate Constant of Oxidation k (g 2 cm~* h~ l ) 



Zinc alloys in atom 






per cent 








Observed 


Calculated 


/inc, pure 


0-8 . ID- 9 


___ 


zinc -f 0-1 Al 


1-0. 10- l 


4 . 10- 11 


-f 1-0 Al 


< 1-0 . 10 ! 


4 . 10- 12 


/inc -f 0-4 Li 


2-0 . 10 


1 . 10-' 


zinc + 0-1 Tl 


2-0. 10- 


. 


+ 0-5 Tl 


2-0- 10- 


_ . 


4- 1-0 Tl 


1-3 . 10- ' 





'g 




hr 



Fig. 15. Influence of foreign metals on the rate of oxidation of zinc, 
as measured by GENSCH and HAUFFE (/>o a r - 1 a tm and T 390 C) 
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of the rate of oxidation of the zinc-thallium alloys on the thallium 
content of the alloy is remarkable. As shown by Fig. 15 the rate of 
oxidation of an alloy containing 0- 1 atom per cent Tl is greater than 
that of pure zinc, whereas a greater thallium content (0-5 and 1-0 
atom per cent) leads to a rate of oxidation which is lower than that of 
pure zinc. This behaviour can only be understood if it is tentatively 
assumed that small additions of thallium enter the zinc oxide lattice as 
univalent cations whereas greater quantities enter as trivalent cations. 




0-006 - 



Fig. 1(5. Dependence of the electrical conductivity of TiO 2 on the in- 

creasing amounts of WO 3 (N"\yo 3 mole fraction) added (p. t i r 1 atm). 

According to HAUFFJS, GRUNEWALD and 



Since the common impurities of zinc are aluminium, iron and gal- 
lium, but not alkali metals, oxidation produces zinc oxide layers con- 
taining trivalent cations and giving a lower rate of oxidation than 
observed for pure zinc. It is, therefore, permissible to assume that the 
high value of the rate of oxidation of zinc mentioned on page 79 is a 
direct indication of the purity of the zinc used. 

From the theoretical viewpoint, and according to conductivity 
measurements on mixed crystals of TiO 2 and W0 3 recently carried out 
by HAUFFE, GRUNEWALD and GREESE 34 (cf. Fig. 16), the rate of oxida- 
tion of titanium may be expected to diminish in the presence of tungsten, 
which decreases the concentration of oxygen ion vacancies. An investi- 
gation of the oxidation of titanium alloys from this special point of 
view has not yet been made. The dissolution of tungsten oxide in 
titanium oxide can be formulated as follows : 



W0 



20 



W0 







TiO 2 

+ Ti0 



i0 2 (g) . 



(45) 
(45a) 
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According to the tentatively proposed equilibrium, 

TiO a ^ on' + e + i(X ( K>, .... (46) 

the rate of oxidation of titanium and titanium alloys, in contrast to the 
oxidation of zinc and its alloys, seems to be determined by the diffusion 
of oxygen ions. HAUFFE and PFEiFFER 39 show that the rate of 







Fig. 17. Decrease of the rate of oxidation of titanium due bo the 

presence of WO 3 (curve II) in oxygon of 1 aim at 800 "C as 

measured by HAUFFE and PFEIFFER 

oxidation of titanium decreases in the presence of WO :} vapour in 
oxygen (Fig. 17). 

Oxidation Processes Invoicing Electron-hole-conducting Layers 
The rate of oxidation of a metal covered by an electron-hole-conducting 
oxide layer can be increased or decreased by addition of a foreign 
metal whose oxide dissolves in the mother oxide arid which gives 
monovalent or trivalent cations respectively. The cause of this 
behaviour can be understood by applying the theory of lattice defects 
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to mixed oxide crystals. Fig. 18 shows schematically what happens 
when monovalent ions, i.e. Li '-, enter the crystal lattice of nickel oxide 
which is of the hole conducting type. According to the equation 



Li 2 



2Li*'(Ni) + 20 + 2MO 



(47) 



Ml* 
s 



0= 
Li* 



0= D 

Ni~* 0= 



+O = Nv++ 0" 
NL++ 0= Ni 

+0 = L^ 0= 
Nt*+ 0= US 
s NL++ 0= 



Fig. 18. Lattice defects in a 
mixed crystal of NiO-Li 2 O 



and associates 7 
(see Fig. 19). 



where Li^'(Ni) represents a lithium ion 
in a nickel position and the ' the negative 
charge of this "impurity centre," we 
obtain an increase of the electron hole 
concentration with an increasing con- 
centration of lithium oxide, and, therefore, 
an increase in the electrical conductivity. 
The conductivity measurement of VERWEY 
have confirmed the presence of this type of defect 



12000 



8000 



000 




NiO 



Fig. 19. Dependence of the electrical conductivity of NiO on the 

increasing amount of Li 2 O added, as measured by Y r EitWF,Y, HAAYMAN 

and KOMEYN at 20 C 

In analogy to equation (4) the equilibrium condition for lattice 
defects in nickel oxide in an atmosphere of oxygen reads 

i0 2 (ff) ^ NiO + NiU" + 20. .... (48) 

Applying the ideal law of mass action, we get 

%iu" c e = K (if p 0f = const) .... (49) 

where c NiLr represents the concentration of nickel ion vacancies. 
Furthermore, electroneutrality leads to 

c e - 2c mn * + c Lir(N1) . .... (50) 

If c u = c Li 0/ (Ni) ^> CNIQ* and if we substitute silver ions for lithium 
ions, then the ratio of the oxidation rate constant of a nickel-silver 
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alloy to that of pure nickel k, calculated from equations (49) and 
(50), equals . 



The decrease in the rate of oxidation due to the presence of silver in 
nickel is far less than expected from equation (51). The cause seems to 
be the small solubility of silver oxide in nickel oxide, as observed even 
at 100()C by HAUFFE and PFEiFFER. 41 

Making use of the remarkably great solubility of lithium' oxide in 
nickel oxide and the fact that at 1100C the vapour pressure of lithium 




Fig. 20. Dependence of the electrical conductivity of NiO on the in- 
creasing amount of Cr 2 O 3 added, according to HAUFFK arid BLOCK 

oxide is satisfactorily high, HAUFFE and PFEiFFER 39 investigated the 
attack of oxygen on nickel at 1000 and 1100C in the presence of 
Li 2 vapour. According to equations (47), (50) and (5 la) 



Ni** = 
0= D 
NL++ = 
0= Cr^ 



i** = 



0= D 0= 



we obtain a decrease in the rate of oxidation of nickel (see Fig. 19). 
At present a quantitative evaluation is not 
yet possible. 

On the other hand, if chromium oxide is 
dissolved in nickel oxide the electric con- 
ductivity is lowered considerably (see Fig. 
20). 38 According to WAGNER'S scheme shown 
in Fig. 21, the presence of trivalent chromium 
ions lead to an increase in the concentration 
of nickel ion vacancies and to a decrease in 
the concentration of electrons. The equation for the solution of 
Cr 2 3 in nickel oxide then reads 

2 + Cr 2 O 3 ^ 2Cr*'(Ni) + 2NiO + |0 2 (g) . . . (52a) 
or 

Cr 2 3 ^ 2Cr0*(Ni) + Nin" + 3NiO (526) 

Here the symbol Cr0'(Ni) represents a chromium ion in a nickel ion site. 
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Fig. 21. Lattice defects in 
a mixed MO-O 2 O 3 crystal 
as proposed by WAGNER 
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Remembering that the condition of electro-neutrality holds in the 
NiO-Cr 2 O 3 phase we obtain 

c ._ = -Jc -|- -J-c * . .... (53) 

and, approximating, 

r ~ l r fif r r ^ r ^ f54^ 

C Nin" ^ 2 C ('r V 11 6 ('r G Cr0'(Ni) ^ G NiG*/ V*</ 

The ratio of the oxidation rate constant of a nickel-chromium alloy k to 
that of pure nickel k is 

k c cr o 

F ^ 2c. .^ CCF " : fNiu "^ .... (5o) 

Although there had been many empirical investigations on the oxida- 
tion of nickel alloys, the first experiments with the theory of lattice 
defects in view were carried out by WAGNER and ZiMENS. 83 In agree- 
ment with the requirements of equation (55), Table 3 shows that the 
rate of oxidation of nickel increase with increasing amounts of chromium. 
The relatively rapid decrease in the rate of oxidation at higher concen- 
trations of chromium (> 6 atom per cent Cr) is probably due to the 
formation of a compact layer of the spinel phase NiCr 2 4 , in which 
ionic diffusion processes occur extremely slowly. 

TABLE 3 

The rate of oxidation of nickel-chromium alloys at 1000 'C 
as measured by WAGNER and ZIMKNS 



Weight per 
cent Cr 


t sec 


Am/A 


g . cm- 3 


k ' ~ ( i / 




57600 


4-1 


. 10 3 


3-1 . 


10 -io 


0-3 


57600 


8-8 


. io- <l 


14-0. 


10-io 


1-0 


57600 


12-3 


. 10 3 


26-0. 


10 10 


3-0 


57600 


13-4 


. 10 - 3 


31-0. 


!0-io 


10-0 


57600 


2-9 


. io- 3 


1-5 . 


K)- 10 



As was pointed out by WAGNER and ZIMENS the quantitative dis- 
cussion of the experimental results is rather difficult, because we must 
assume that nickel and chromium ions do not possess the same mobilities, 
so that we are here dealing with a solid solution whose composition is a 
function of the distance from the oxide/metal interface. If the mobility 
of the chromium ion is small, the region near the oxide/metal interface 
will be rich in chromium oxide. Conversely, if the mobility of the 
chromium ion is the larger of the two, the concentration of chromium 
oxide near the outer surface will be above the average concentration. 
At temperatures above 1000C no accumulation of Cr 2 3 near the 
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outer surface is in fact observed, but this may be because Cr 2 O 3 is 
rather volatile. Concerning this problem an electron diffraction and 
electron microscope study carried out by GULBRANSEN, PHELPS and 
HiCKMAN 25 is of importance. The quantitative treatment of the 
different mobilities of the divalent nickel ions and trivalent chromium 
ions opens up new possibilities of understanding many of the rather 
complicated oxidation processes of alloys. 

The oxide layer formed on nickel-chromium alloys with 10-20 per 
cent Cr is composed either of a spinel (> 1000C) 7a > 60 or of Cr 2 O 3 
(< 1000C) 25 ' 56 . In addition the greater mobility of chromium ions in 
spinel and NiO-Cr 2 O 3 -layers observed by HAUFFE and PscHERA 44 seems 
to be of importance for the formation of oxide layers. In the case of the 
oxidation of iron-aluminium alloys an increase of A1 2 O 3 in the oxide 
layer was found by KORNJLOW and SciiPiKELMANN. 65 An extensive 
discussion of this matter was given by HAUFFE. 36 

THE FORMATION OF MULTI-LAYERED SCALE CONTAINING 
VARIOUS METAL OXIDE PHASES 

One of the important factors determining the resistance of a metal or 
alloy to oxidation at high temperatures is the structure of the oxide 
layer formed at the surface. An extensive investigation of structure 
has been carried out by GULBRANSEN and associates. 22 " 26 Applying the 
electron diffraction reflection technique they could observe the structure 
of the oxide layer in situ at temperatures at which it is actually protec- 
tive. The results of these measurements are of particular importance 
for the class of oxidation processes in which, from a thermodynamic 
point of view, two or three oxide layers can be expected. 

One of the most thoroughly investigated processes is the oxidation of 
iron at different temperatures and pressures of oxygen. The first 
relatively complete studies of the oxidation of iron at elevated and low 
temperatures was carried out by JACKSON and QuARRELL 48 and FISCH- 
BECK 16 and NELSON. 66 They recognized that the formation of different 
oxides and their structural transformations are functions of time as 
well as temperature. They found that below 400C the stable oxide on 
iron was Fe 3 4 , and that FeO was formed between 515 and 900C. 
Between 400 and 515C, FeO, Fe 3 4 or a mixture of the two oxides 
were found. JiMORi 50 could distinguish between the cubic Fe 2 3 and 
Fe 3 4 by means of x-ray analysis ; by electron diffraction this would 
have been impossible. From x-ray evidence JIMORI postulated a 
transition of y-Fe 2 3 with cubic spinel structure to <x-Fe 2 3 with 
rhombohedral structure at 340C. GULBRANSEN and HICKMAN found a 
lower transition point at about 225C. For long periods of oxidation at 
up to 400C they observed the formation of a-Fe 2 3 . Alongside this 
evidence we should note that for thick films formed at 500C the 
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inverse spinel Fe 3 4 is the only oxide observed. Finally, we have studies 
by BENARD and associates 4 of the influence of temperature on the 
kinetics of formation of different iron oxide layers (see Fig. 22). Thus 
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Fig. 22. Influence of temperature on the "kinetics of formation of 

different iron oxide layers, as measured by B^NAHD and COQUELLK. 

(N = mole fraction of FeO, Fe 3 O 4 and Fe 2 O 3 , respectively) 

among all these results there seem to be many disagreements ; they are 
due to difficulties which may be listed as difficulties of 

(a) reproducibility, 

(6) accurate measurement of the surface temperature, 

(c) accurate determination of the time of oxidation, 

(d) controlling the film thickness, and 

(e) controlling the pressure of oxygen. 

Recently BIRCHENALL and associates 83 - have investigated the 
mechanism and the kinetics of the scaling of iron under the pressure of 
1 atm of oxygen between 400 and 1200C. The results indicate that in 
no case, above a temperature of about 625C, does the layer contain 
more than 10 per cent by volume of the higher oxides Fe 3 4 and Fe 2 3 

TABLE 4 

Approximate percentages of FeO, Fo 3 O 4 and Fe 2 O 3 in 

scale formed on iron oxidized in oxygon ; according to 

DAVIKS, SIMNAD and BIRCHENALL 



T'C 


t min. 


FoO pet 


Fe 3 O 4 pet 


Fo,O, pet 


600 


100 


50 


50 




625 


110 


70 


30 


. 


700 


225 


98 


2 


. 


750 


100 


96 


4 





800 


100 


96 


2 


2 


900 


100 


90 


9 


< 1 


1000 


25 


95 


4 


< 1 


1090 


25 


94 


5 


< 1 
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(see Table 4). To measure the rate of growth of Fe 3 4 and Fe 2 3 , 
samples of FeO and Fe 3 4 were prepared and oxidized in oxygen. The 
values for the parabolic rate constants are given in Table 5. Below 

TABLE 5 

Values for the Parabolic Rate Constants k (g cm~ 2 min 1/2 ) 

of the Oxidation of Iron in Oxygen ; from DAVIES, 

SIMNAD and BIRCHEN ALL 



TC 


k 


Remarks 


400 


3-4 . 10- 8 




500 


5-0 . 10~ 5 




600 


9-0 . 10- 5 




700 
800 
900 


6-0 . 10~ 4 
1-85 . 10- 3 
'3-9 . 10~ 3 


Oxidation of Fe to form 
multilayer scales 


1035 


9-0 . 10- 3 




1130 


1-24 . 10" 2 




1200 


1-87 , 10 2 




850 


2-3 . 10~ 4 




910 
930 


3-3 . 10-* 
3-9 . 10~ 4 


Oxidation of FeO 


950 


4-9 . 10~ 4 




800 


4-5 . 10~ 5 




900 


1-5. 10- 4 


Oxidation of Fe 3 O 4 


988 


3-0 . 10- 4 





570C the oxide layer formed is almost pure Fe 3 4 . The reactions 
involved, neglecting the formation of Fe 2 3 , are : 

3Fe + 2O 2 ^ Fe 3 O 4 .... (56a) 

3FeO + JO 2 v* Fe 3 4 . .... (566) 

BIRCHENALL showed that, for a given thickness of Fe 3 4 , the increase 
in weight for process (56a) is four times as great as for process (566). 
Now BIBCHENALL and associates tentatively suggest that the growth of 
each layer is a diffusion controlled process and that the interface reaction 
rates may be neglected. Applying a modified Pfeil technique using 
radioactive silver Ag 110 as inert markers they find a very interesting 
diagram of the oxidation process of iron (see Fig. 23). Whereas for the 
formation of FeO the rate determining step is the migration of Fe 2 +- 
ions, in hematite it is not the iron ions that wander but the oxygen ions. 
In magnetite, however, both ions migrate. These results are consistent 
with the semi -conducting properties of the oxides of iron. 

On oxidizing copper at 1000C at an oxygen pressure of about 
100 mm Hg, an outer and an inner layer of CuO and Cu 2 0, respectively, 
are formed. Just as the molar volume of Cu 2 is greater than that of 
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metallic copper, so the molar volume of CuO is greater than that of 
Cu 2 O (1 g- atom of Cu = 7-1 cm 3 ; \ mole of Cu 2 O = 11-7 cm 3 ; 
1 mole of CuO = 12-3 cm 3 ). The cupric oxide layer on the cuprous 
oxide layer is, therefore, compact. Under these conditions PILLING 
and BEDWORTH, 68 and also FEiTKNECHT 15 and FROEHLicn 19 found that 
the rate of oxidation of copper is independent of the oxygen pressure. 
This experimental observation is easily understood. The excess of 

/ // /// -iv 
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Fig. 23. Schematic diagram of the diffusion of ions across the different 

oxide layers during the oxidation of iron on the basis of the experimental 

results of DAVITCS, SIM NAD and BIRCHENALL 



oxygen in the Cu 2 phase at the Cu 2 0/CuO interface is determined by 
the two-phase equilibrium Cu 2 O + CuO, as is also, therefore, the 
concentration gradient of copper ion vacancies in the cuprous oxide 
layer. On the basis of the rate of oxidation of copper to Cu 2 O, and of 
Cu 2 to CuO, WAGNER 88 suggested that the ratio by weight of the 
two oxides formed (at 1000C and an oxygen pressure of 1 atm) is 
given by : 

Aav,io A'Vmo 2-5 . 10~ 12 

LJ12. =, rUl - = ... = o-4 . 1C)- 3 

/V'vj ^ i* 6*fS 10 '^ 

i.e. under the above experimental conditions practically pure cuprous 
oxide is formed, only the outer surface, representing 0-1 per cent of the 
total thickness of the film, consisting of cupric oxide. It can be assumed 
that the sulphuration of iron at 670 "C and a sulphur vapour pressure of 
1 atm gives to an iron sulphide layer of analogous composition. Here 
the rate of sulphuration of iron to FeS is a thousand times faster than 
that of FeS to FeS 2 , as was recently shown by HAUFPE and RAHMEL 37 
(& Fc8 = 3-3 . 10~ 8 and & FcSf = 1-4 . 10~ n (equ. cm" 1 s- 1 )). A great 
number of metals and alloys have been investigated from the same point 
of view, in particular by GULBEANSEN and associates. 22 " 26 

96 



MECHANISM OF OXIDATION OF METALS AND ALLOYS 

THE FORMATION OF THIN COMPACT OXIDE FILMS ON METALS 

Besides WAGNER'S theory MOTT and CABRERA S have recently put 
forward another theory of the oxidation of metals. WAGNER'S formula 
is valid only if the oxide layer is so thick that the concentrations of 
interstitial ions (or ion vacancies and free electrons, or electron holes) 
are equal to one another throughout most of its thickness. MOTT and 
CABRERA derive a formula especially for the rate of oxidation in thin 
and very thin oxide films with a thickness between 10 and 1000 A. 
Applying the fundamental assumptions of the SCHOTTKY-MOTT 
boundary layer theory 72 * 64 (Randschicht-Theorie) of crystal rectifiers, 
MoTT 65 was the first to show that in the vicinity of the metal/oxide 
interface there is a diffuse electrical double layer because the electrons 
move more quickly to the adsorbed oxygen atoms than the ions. This 
process plays an important role if the oxide film is thinner than 100 A. 
In such a thin film the electrical field strength is so high that the rate of 
migration of ions is no longer proportional to the field strength (i.e. it 
is no longer inversely proportional to the thickness of the oxide film) 
but depends quasi-exponentially upon the thickness. 

MOTT and CABRERA theoretically gave different conditions requiring 
the logarithmic law, the parabolic law (x 2 2kt), the cubic law 
(x 3 '3kt, experimentally confirmed by CAMPBELL and THOMAS 6 for 
the oxidation of copper), and lastly the linear law (x = kt, experi- 
mentally confirmed by GULBRANSEN and WYSONG 28 for the oxidation of 
aluminium). WAGNER 90 recently pointed out that the possibilities are 
not yet exhausted. Whereas MOTT'S theory of oxidation processes 
assumes one double layer at the oxide/metal interface, WAGNER expects 
during the growth of an oxide film an overlapping of two double layers, 
the one at the metal/oxide interface and the other at the oxide/oxygen 
interface. Thus another interpretation of the logarithmic law seems 
possible, especially for films with a thickness greater than 100 A. 

MOORE and LEE'S experimental results 62 for the oxidation of zinc, 
which cannot be explained either by the theory of MOTT-CABRERA or 
by WAGNER'S theory, led ENGELL and HAUFFE" to give an explanation 
which might be of general interest and will, therefore, be described in 
some detail. For layers of up to 1000 A MOORE and LEE found that the 
parabolic law of growth (and at the same time the rate of growth) was 
dependent on the pressure of oxygen, whereas WAGNER'S theory predicts 
a rate of growth independent of the oxygen pressure. WAGNER and 
GRUENEWALD had, indeed, found for thick layers (> 1000 A) a para- 
bolic growth with the rate of growth independent of oxygen pressure. 
Using the ScHOTTKY-SrENKE boundary-layer theory of crystal recti- 
fiers, ENGELL and HAUFFE advance an explanation of the above 
observations. 
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During the formation of the oxide layer two regions must be dis- 
tinguished. As shown by Fig. 24 there is a small region A in which 
an electric field has been set up because of a depletion of quasi-free 
electrons which, owing to their relatively great mobility, have been 
captured by the chemisorbed oxygen atoms on the surface. In this 
region the migration of the interstitial zinc ions is mainly due to this 
electric field, while in the region B the motion of the interstitial zinc 
ions is mainly caused by the concentration gradient. The thickness I 
they call the critical thickness. 
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Fig. 24. Scheme of the distribution of quasi -free electrons and interstitial 
zinc ions in the boundary-layer (region A) and in the interior of the 
semi-conductor (region B). (L is the critical thickness of the boundary 

layer) 

For both regions of the oxide layer the current of quasi-free electrons 
/ and of interstitial zinc ions I K is given by the usual equations. 



7^ 



dx 



- E(x) 



(57a) 



IK = 



V K . E(x)n K . B K 



(576) 



-i 



n K and n Q the concentration 



Here B is the mobility in cm 2 . V~ l s 
of interstitial metal ions and quasi-free electrons respectively, V K the 
charge of the metal ions in interstitial sites and E(x) the electric field 
strength in V . cm" 1 at the point x. Equations (57a), (576) and the 
Boltzmann equation 

n n i V n e\ 

exp I I .... (58) 

n^ \ kl / 

exactly determine the distribution of positive and negative charges 
in the boundary layer. Here V D is the diffusion potential, n n and n H 
the concentration of quasi-free electrons in the boundary layer and in 
the interior of the oxide layer. 
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The system is stationary, if the concentration of interstitial cations 
in the boundary layer is such that the divergence of the current of 
cations moving across the whole layer (A and B) is zero. Since the 
electric field may be neglected in the interior of the oxide (E for 
x >l) we obtain, neglecting the transport of cations in the boundary 
layer by diffusion, 

K ^~} =^ K n K (x)B K .E(x)) I<l . . . . (59) 

dX J x > / 

With D K =B K kT/e and Poisson's equation SE/Sx = -n K (x), 

e 

s being the permittivity, we obtain on integrating between the limits 
x = and x = I for the concentration of interstitial cations in the 
boundary layer . 

{P bT /dw \ W 2 
-JL..*: 7 (fM (Z -*)>/*. . . . (60) 

Sire v K *e \ a*/*>jj v ' v ' 

By means of a second integration from to I we finally obtain for the 
thickness of the boundary layer 

fe) }"V .... ,6.) 

\ dx I x>i) 

For a critical discussion of these equations see ENGELL, HATJFFE and 

SCHOTTKY. 13 

Using the equation for the chemisorption of oxygen on zinc oxide 

|0 2 (g) + 2e<"> - 2 ~ (A) .... (62) 

HAUFFE and ENGELL 42 finally obtain for w C)a - (A) 

1 



Therefore w^, the number of chemisorbed oxygen atoms, increases 
logarithmically with increasing partial pressure of oxygen. All these 
formulae are only valid, if there is an "exhaustion layer" (the layer A of 
Fig. 24). They seem to be approximately true in zinc oxide up to 300C. 

If the entire thickness of the oxide layer is less than I the rate of 
oxidation is dependent on the partial pressure of oxygen even though 
the diffusion of metal ions to the surface is rate-determining. The 
reasoning is as follows. 

At the beginning of the oxidation and for a constant partial pressure 
of oxygen, which leads to a constant diffusion potential V D the electric 
field strength E(x^ will decrease with increasing thickness x and we 
have E(x^ > E(l), ifx l < I. Now it may be assumed that the majority 
of the electrons arrive at the surface of the oxide, initially, by means of 
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the tunnel effect and later, for a thicker boundary layer, by means of 
thermionic emission (cf. SCHOTTKY and MOTT-CABRERA). If their con- 
centration is n we can put n ^> n H) and n is practically independent on 
the thickness of the layer x v if x l < I. Electrons reaching the surface 
must do so against the electric field. 

Let N be the maximum number of surface sites per square centi- 
metre available to chemisorbed oxygen atoms, and K be a constant. 
We obtain : 

7i (A) / r e\ 

, = K Po, 1/2 T* exp - 2E( Xl ) .... (65) 



For small surface concentrations (n ( A) . <^ N) we have 

E(xJ = ~ . 2n&> - K' . p /' 2 . n* . exp (- 2E(x 1 ) 

where 

K' = K. *?- N. 

8 

We finally obtain 

kT 

E(x 1 )=ln(p 0t .n*.K'*)-]iLE(x 1 ) ..... (67) 

QX]G 

In this equation lnE(x 1 ) <^ E(x l ), since E(x l ) is about 10 5 V/cm. If 
there is a strong electric field the current due to diffusion may be 
neglected compared with the current due to the electric field and we 
obtain from (576) 

I K = n K .B K .E( Xl )v K . .... (68) 

Combining (576), (67) and (68) we finally have for the current of 
cations (i.e. the number of the cations entering the surface per square 
centimetre per second), 

kT 

.... (69) 



Comparison of (576) and (66) shows that the rate of formation of thin 
films is proportional to the surface concentration of chemisorbed 
oxygen (I K oc E(x) oc n ( ). 

If the transport of cations to the surface determines the rate of 
oxidation of the metal, equation (69) gives the following rate law 

d# k 

di = ^ ---- < 70) 

where 

Vm kT 

k =-pj-- S K-v K -- ( ln Po, 
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and V m and TV are the molar volume of the oxide and Avogadro's 
number, respectively. Equation (70) shows that the rate constant of 
oxidation k is a logarithmic function of the partial pressure of oxygen 

k const . Inp 0a + const' .... (71) 

The validity of equations (70) and (71) was experimentally confirmed 
for the oxidation of zinc by MOORE and LEE. As shown in Figs. 25 and 




Fig. 25. Oxidation of r zinc at 390 "O, in parabolic plot, 
according to MOORE and LEE 




Fig. 26. Logarithmic dependence of the parabolic rate constant k of 

the oxidation of zinc on the partial pressure of oxygen, according 

to ENGEJJ, and HAUFFE, 011 the basis of measurements carried 

out by MOORE and LEE 

26 the rate of oxidation follows the parabolic rate law and k is a 
logarithmic function of the partial pressure of oxygen, as is to be 
expected according to equation (71). 

WAGNER and GRUENEWALD also investigated the influence of the 
partial pressure of oxygen on the rate of oxidation of zinc and found 
k to be independent of p Oj . This result is in good agreement with 
WAGNER'S theory of oxidation as well as with the views advanced in 
this article, for in WAGNER'S experiments the condition x ^> / was 
fulfilled. 
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This article does not aim at giving a complete review of all important 
experimental facts concerning the oxidation of metals and alloys. 
For example, there has been no reference to the interesting experiments 
on the internal oxidation of copper and silver alloys carried out by 
RHINES and associates 69 * 70 and MEIJERINO and RATHENAU. 59 The 
main purpose has been to stimulate the metal expert by giving him some 
of the new theoretical points of view on this matter. I would like to 
express my gratitude to DR. R. LANDSBERG for numerous stimulating 
discussions, and for the linguistic help he has given me. 
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3 
GASES IN METALS 

C. R. Cupp 

THE important relationships between metals and the elements normally 
thought of as gases has been under consideration for many years. In 
earlier times studies were confined mainly to solubility, diffusion, and 
embrittlement problems and casting defects caused by gases ; now more 
subtle effects on the physical and mechanical properties of the metals 
are being considered. There are several books 1 " 5 and reviews 6 "" 11 dealing 
with the theme of this paper. Other reviews of a more specialized 
nature will be referred to throughout. The present paper is an attempt 
to summarize the more important experimental facts and to mention 
the significance of these as interpreted by various observers. In 
keeping with the purpose of this series, particular reference will be made 
to more recent observations, but the results of earlier work must, at the 
same time, be given due recognition. 

As in an earlier review, 9 it should be pointed out that only those 
elements will be considered which exist as gases in their uncombined 
states, at normal temperature and pressure, namely, hydrogen, oxygen, 
nitrogen, the inert gases and the halogen gases. As a supplement to the 
list of references, a number of papers dealing with such combined 
gases as sulphur dioxide, carbon monoxide, carbon dioxide, hydrogen 
sulphide, etc., will be given, but no consideration of these will be made 
here. It is felt that purely adsorption phenomena, and corrosion effects 
due to gases (including surface oxidation) are not part of the subject at 
hand. Commercial degassing practice will not be reviewed, although 
some mention will be made of its necessity in various operations. 
Experimental methods will not be described in any detail ; the reader is 
referred to the original literature for this information. This paper will 
deal with the solution and diffusion of gases in solid metals ; with the 
absorption, retention, and evolution of gases by metals; and with 
effects upon the properties of the solid metals. The consideration of tho 
marked effect of gases on many of the physical properties of metals will 
be the main theme of this work. 

GAS-METAL RELATIONSHIPS 

FAST has argued that the permeability of metals to gases is related to 
the ability of a given gas to combine with a given metal, diffusion rates 
being high where compound formation is possible. RHINES has 
amended this statement to say that diffusion is appreciable in those 
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gas-metal couples in which the gas is soluble in the metal, to a significant 
degree. SMITH 2 has made a more pointed generalization, in the case of 
hydrogen at least, by emphasizing that increased permeability of the 
metal, however produced, is always accompanied by an increase in the 
"occlusive capacity" of the metal for hydrogen. At any rate if one 
considers that the halogen gases, although they react actively with 
practically all metals, have been found not to diffuse to any marked 
degree through any metals, or to dissolve in appreciable amounts in 
solid metals, then it must be assumed that FAST'S statement is not 
correct. 

Inert Gases 

Turning to the role of inert gases, there can be no compound formation 
and the possibility of solubility is questionable. It has been reported 
recently 12 that the amount of argon absorbed by uranium at room 
temperature is less than 1 part in 3 X 10 5 by weight and at 6000, 
less than 1 part in 3 X 10 4 , which is the magnitude of the uncertainties 
of measurements of the gas volume. Elsewhere 13 it is indicated that 
inert gases are not dissolved to any measurable extent in solid metals. 
Similarly the evidence of diffusion of inert gases through metals is not 
well confirmed. The earliest report found in the literature 14 showed that 
there is no permeability of metals to inert gases. From preliminary 
investigations 15 of the removal of inert gases in discharge tubes at low 
pressures it was erroneously concluded that the gas penetrates into the 
cathode as ions, while the cathode metal is disintegrated and rede- 
posited by diffusion. Later work of the same nature showed quite 
conclusively 16 that the apparent loss of neon from lamp tubes was due 
not to the passage of the gas through iron either at room temperature or 
at a bright red heat, but to an apparent adsorption upon the iron under 
the influence of an electric discharge. As a very brief summary of the 
work by LUMPE and SEELIGER IG one would say that they found that, in 
the discharge tube, the inert gas particles were "shot into" ("hineinge- 
schossen") the cathode arid there "remained stuck" ("stecken bleiben") 
in some sort of mechanical trap. A curve showing the distribution of 
the particles indicated a greater concentration at the surface. On 
heating to a few hundred degrees Celsius the gas particles are released 
and "diffuse" ("diffundiereri") out of the cathode. No claim was made 
that there was any true equilibrium solubility of the inert gas in the 
iron cathode, and it was not suggested that diffusion proceeded to 
cause a uniform distribution of the gas throughout the cathode. 

One other experiment, of doubtful value, 17 indicated that at 505C 
and a gas pressure of 72 cm Hg the diffusion of helium through alumi- 
nium was as rapid as that of hydrogen. The measured rate was 
0-026 x 10~ 7 ml N.T.P. x mm thickness per sq cm heated area per 
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second. Since this anomaly was not investigated further it must remain 
subject to doubt. 

Nitrogen 

In his recent review RHINES 9 points out that the compound rule pro- 
posed by FAST 6 seems to hold in a general way for oxygen, nitrogen 
and hydrogen, but the correlation between solubility and diffusion is 
better. He states further : 

"Nitrogen forms compounds with, is soluble in, and diffuses through, 
most of the metals of the last four groups of the periodic table of the 
elements. It is generally nonreactive with, insoluble in, and fails to 
diffuse significantly through, the first -group metals and only sparingly, 
when at all, through the metals of the second and third groups." 

At an earlier time SMiTHELLS 1 made the following generalization : 

"Nitrogen is only soluble in those metals which are capable of forming 
nitrides. We have previously noted the same distinction both in 
activated adsorption and diffusion, and the close connexion between 
the three kinds of gas-metal equilibria is again emphasized. Nitrogen 
has been shown to be insoluble within the limits of the experimental 
method in cobalt, 18 * copper, 19 * silver, 20 * and gold. 21 * Those metals 
which, like iron and molybdenum, absorb nitrogen, show the same kind 
of changes in lattice structure, mechanical and electrical properties as 
are shown when the hydride -forming metals absorb hydrogen. The 
nitrogen-metal systems have not been investigated so completely as 
those of hydrogen, but the close parallel between them is evident from 
the data/' 

Later SMITHELLS 7 stated that "nitrogen is soluble in a limited number 
of metals and although there is evidence of simple solubility in some 
cases, all metals which dissolve this gas are known to form definite 
nitrides under suitable conditions." 

An examination of the literature following this brief statement will 
indicate that SMITHELLS' generalization is probably closer to the truth 
than RHINKS' statement. It is recognized that some of the errors in 
RHINES' statement may be due to the type of periodic table being used. 
Nitrogen is generally soluble in groups VA, VIA and VII A but (except 
for iron) not VIII, if those are to be taken as the "last four groups." 
It is also soluble in group I VA. RHTNES is quite correct about nitrogen's 
relation with groups 1A and IB. There is some ambiguity in his state- 
ment regarding the second and third group elements, where nitrogen 
reacts with several elements to form nitrides but has not been reported 
to dissolve in the same elements. SMITHELLS' statement seems to be 
more generally acceptable. 

On the basis of the two statements above, it seems worthwhile to 

* This number refers to the list of references at the end of the present paper. 
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investigate the literature to determine which metals do form nitrides 
and to find whether these do dissolve nitrogen under other conditions. 
Table 1 shows the elements of interest and their positions in the 
periodic table. WELLS 22 reports that most elements combine with 
nitrogen, and the nitrides include some of the most stable compounds 
known. He reports that the ionic nitrides include those of lithium, 
beryllium, magnesium, the metals of IIA and IIB, germanium and 
thorium, but not those of the IVA metals of the type TiN. A nitride 
of mercury, Hg 3 N 2 , also occurs but may be different in structure from 
the other Group II nitrides. The nitrides of beryllium, magnesium 
and calcium in Group IIA are described as being anti-isomorphous with 
oxides M 2 O 3 , i.e. the metal atoms occupy the positions of the oxygen 
atoms and the nitrogen those of the metal atoms in the M 2 3 structure. 
Thus the structure becomes a compound of the type M 3 N 2 . 

Covalent nitrides include the nitrides of Group III. These have 
great chemical stability and are described by WELLS 22 as being covalent 
nitrides. These compounds are formed of equal numbers of atoms of 
elements of Group III and V, and the nitrides of aluminium, gallium 
and indium crystallize with the wurtzite structure. 

WELLS 22 reports numerous nitrides of the transition elements of varied 
structures. The characteristic feature of these nitrides is that the small 
non-metal atoms occupy some or all of the interstices of the metal 
lattice. The relative positions of the metal and nitrogen atoms is 
controlled by the number and nature of the interstices occupied by the 
nitrogen atoms, and they may be the same as in certain simple structures 
such as the sodium chloride and calcium fluoride structures. The 
following compounds have been reported to crystallize with the rock- 
salt structure : ScN, CeN, LaN, TiN, ZrN, VN, NbN, PrN. 

We have now determined which metals will form nitrides at higher 
temperatures ; these same metals may dissolve nitrogen at lower 
temperatures. Quantitative measurements are not important here but a 
review of the literature, indicated in Table 2, will show that there is a 
definite solubility of nitrogen in the metals listed. 

TABLE 2 



Metal 


References 


Metal 


References 


Iron 


3, 10, 23, 24, 25, 26, 27 


Hafnium 


27 


Zirconium 
Tantalum 
Molybdenum 


3, 10, 27, 28, 29 
3, 27 
3, 27, 30, 31, 32, 10, 25 


Thorium 
Vanadium 
Niobium 


27 
27, 33 

27 


Tungsten 
Manganese 
Titanium 


3, 10, 27, 32 
3, 27, 33 
27, 29, 33 


Chromium 
Uranium 


27, 33 

27 
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HAGG 34 and SMITHELLS and RooKSBY 35 have found that, although 
tungsten reacts very slowly to form the nitride W 2 N at high tempera- 
tures, no solid solutions of nitrogen in tungsten are formed. 

It has been shown 36 that in the ranges investigated in the experi- 
mental work, there is no solubility of nitrogen in cadmium, thallium, 
zinc, lead, bismuth, tin, and antimony. More recently it was found 33 
that there is no solubility of nitrogen below 1200C in nickel, cobalt, 
copper, and tungsten. In the references above, which indicated that 
tungsten would dissolve nitrogen, the measurements were made above 
1200C, the solubility being 0-304 cm 3 /100 g at 2400C, so that it is 
rather small if it exists at all. 

RONTGEN and BRAUN 37 found that no measurable amount of nitrogen 
is absorbed by aluminium between 700C and 1000C because a surface 
film of aluminium nitride is formed which prevents diffusion of the gas 
into the metal. At most there may be 1 c.c./lOO g in the molten metal 
at the melting point, but the solubility in the solid metal would be 
much smaller than that. On the other hand, CiiAUDRON 38 and 
MoREAU 39 ' 40 found relatively large amounts of nitrogen (in one case 
7-4 c.c./lOO g in the solid metal) dissolved in aluminium which had 
been electrolytically polished so that no adsorbed layer would affect 
the results. Their method of degassing was to make the specimen the 
cathode in a vacuum discharge tube and apply a potential to the 
terminals of from 5 to 150 kV with a current of about 5 inA. Degassing 
took place in the cold, the surface molecules (of aluminium nitride 
presumably) being ionized and drawn into the electric field, replace- 
ment being effected by diffusion in the metal. In hydrogen-metal 
systems the experimental results presented by these workers all appear 
to indicate a much higher solubility than one would expect. It is felt 
that their findings in the aluminium-nitrogen system may possibly 
result from the decomposition of the aluminium nitride compound and 
do not, in fact, represent the true solid solubility of nitrogen in 
aluminium. 

As was stated earlier, it seems that SMITHELLS' statement that nitro- 
gen is only soluble in those metals which are capable of forming 
nitrides, is the most applicable generality that can be made about the 
nitrogen-metal systems. 

Oxygen 

The measurement of the solubility of oxygen in metals is quite difficult, 
since, except in the case of the noble metals, a separate oxide phase 
is formed when the limit of the solid solubility is exceeded. Thus the 
true solubility of the oxygen may be small in comparison with the total 
amount of oxygen present in the metal. In many molten metals the 
large content of dissolved oxygen is sharply decreased on solidification 
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when the gas may be evolved or may form an oxide of the metal. 
Generally, the solubility of oxygen in metals is of the same order of 
magnitude as that of hydrogen, but it is usually more difficult to 
determine. Unless the metal being used in an investigation is very 
pure, then the usual method of establishing equilibrium by diffusion 
inwards from an external oxide phase is liable to lead to erroneous 
results. As will be shown later, the gas used for preferential internal 
oxidation of impurity elements to form more stable oxides than the 
metal being examined, may be interpreted as gas going into solution. 
Thus a large error may be introduced where the true solubility is, in 
fact, quite small. 

The oxygen-copper system seems to have been investigated more 
thoroughly than any others. HANSON, MARRYAT and FoRD 41 placed 
the solubility limit at less than 0-009 per cent oxygen at 1000C and 
ALLEN and STREET 42 suggested it to be not over 0-005 per cent oxygen 
at 500C. The earliest data on spectroscopically pure copper were given 
by RHINES and MATHEWSON, 43 who reported that the solid solubility 
increases from 0-007 per cent by weight at 600C to about 0-015 per cent 
at 1050C. A later paper by PHILLIPS and SKINNER 44 showed that 
previous determinations had given results which were too high. They 
found the solubility to range from 0-0022 per cent at 700C to 0-0077 
per cent at 1()50C. The probable reason for the apparent difference of 
solubility is the experimental difficulties in the analytical determination 
of such small percentages of dissolved gas. 

Usually the solubility of oxygen in metals is complicated by the 
formation of oxide phases, so that complex phase diagrams result. A 
diagram given by BARRER S illustrates the great variety of solid solutions 
possible. Earlier HERTY and GAiNES 45 showed that oxygen must 
dissolve in liquid iron as FeO, and probably dissolves in the same 
manner in the solid metal. The maximum solubility in alpha-iron is 
probably less than 0-05 per cent and may be slightly higher in 
gamma -iron. 

In the cobalt-oxygen system, SEYBOLT and MATHEWSON 46 found from 
analysis and micrographic observation that the solid solubility of 
oxygen in cobalt increases from about 0-004 per cent by weight at 
550C to 0-020 per cent at 875C, at which temperature the hexagonal 
form of cobalt changes into the face-centred cubic form. In the gas-free 
metal this transformation occurs at 850C. In the cubic cobalt, the 
oxygen content increases almost linearly at constant pressure from 
0-004 per cent at 875C to 0-022 per cent at 1450C. 

A similar solubility of oxygen in the form of nickel oxide (NiO) has 
been reported by MERICA and WALTENBERG. 47 

The solubility of oxygen in silver has been studied by STEAOIE and 
JOHNSON, 20 who showed that it is proportional to the square root of 
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the pressure, is not in any simple relation to the temperature, and is a 
minimum at 400C. They reported that, at a pressure of 80 cm of 
mercury, the solubility varies from 0*142 c.c. of oxygen per c.c. of 
silver at 200C to 0-087 c.c. at 400C, 0-193 c.c. at 700C, and 0-354 c.c. 
at 800C. They suggest that below 400C, the dissolved oxygen exists 
in the form of Ag 2 O, and above this temperature as oxygen atoms, since 
Ag 2 O is unstable at the higher temperature. SiMONS 48 argues that the 
minimum is due to the rate of increase of the dissociation pressure of 
Ag 2 O with temperature being greater than the rate of increase in 
solubility of Ag 2 O in silver. 

SMITHELLS 27 indicates that oxygen is probably slightly soluble in 
palladium and rhodium, but there are no published data. Oxygen is 
insoluble in mercury, gold, osmium, iridium, and platinum. If there is 
any solid solubility of oxygen in tin and lead it would be very small. 
GROSHEIM-KRISKO, et aZ 49 report that the oxygen content of molten lead 
at 350C to be 48 60 X 10~ 5 weight per cent. Solid solutions con- 
taining up to 40 atomic per cent of oxygen without the separation of 
an oxide phase are possible in zirconium 28 ' 50 51 and titanium. 52 
Oxygen is also soluble in vanadium, niobium and tantalum. 

Hydrogen 

In the metal-hydrogen systems there are four different types of inter- 
action between the gas and the metal, as follows : 

1. Hydrogen will react to form hydrides of a salt-like nature with 
the alkali and alkaline-earth metals. These primary salt-like 
hydrides are ionic, and generally crystalline, non- volatile com- 
pounds. These compounds, with the negative hydrogen or 
hydride ion H~ are analogous to the corresponding halides. 
GiBB 53 and RuNDLE 54 have fairly recent papers dealing in some 
detail with the formation of these hydrides, and their references 
will lead to a broader view than need be given here. 

2. With the metals of Group IVB, VB and V1B covalent hydrides 
are formed yielding compounds, gaseous at room temperature, 
analogous to the hydrocarbons and typified by the familiar 
arsine. Some of the covalent hydrides formed are AsH 3 and 
SiH 4 . 

3. A number of metals, such as chromium, iron, cobalt, nickel, 
platinum, copper, silver, molybdenum, magnesium, and 
aluminium, dissolve hydrogen to form true solutions as indicated 
by the fact that (a) the hydrogen solubility varies as VP, and 
(b) solubility increases with increasing temperature. 

4. With the rare-earth metals and titanium, zirconium, tho- 
rium, hafnium, vanadium, niobium, tantalum, uranium, and 
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manganese, hydrogen forms what DUSHMAN IG calls "pseudo- 
hydrides." SMITH 2 describes these metals as being "exothermic 
occluders," while those in section (3) above are "endothermic 
occluders." Although the solubility varies as V P for certain 
ranges of values of pressure (P), it decreases inversely as the 
temperature. Palladium behaves similarly with respect to 
hydrogen. 

As far as the writer has been able to determine, hydrogen is not 
absorbed by tungsten (may have small solubility), gold, zinc (?), 
cadmium (?), indium, thallium, or mercury. Uranium has been re- 
ported to have a small solubility of hydrogen in one reference 27 and to bo 
an "exothermic occluder" in another reference. 2 

Solid tin and lead have a very small capacity to absorb hydrogen. 

GAS-METAL EQUILIBRIUM, DIFFUSION AND SOLUTION 

It has been stated in the introduction to this paper that the effects of 
gases on the physical and mechanical properties of metals will be the 
main subject of study. However, the chemical considerations of the 
overall problem must not be entirely overlooked. In the preceding 
section, the various possible gas -metal combinations were reviewed in 
an attempt to show where one might expect to find unusual physical 
effects. In comparatively recent years there has been an increasing 
awareness of the profound effects which gases may have on solid metals. 
Much more study is needed, however, before a reasonable theory can be 
developed to cover the entire gas-metal field. 

The formidable volume of work which has been devoted to the study 
of gas-metal equilibria is indeed impressive. It is felt that it is better 
not to consider this field at all, rather than to create a false impression 
by a too brief review ; therefore, references will be made to a few fairly 
recent reviews devoted largely to this phase of the work. The reader is 
warned, not unjustly it is felt, to be extremely critical of solubility 
measurements and diffusion rates particularly for oxygen. As an 
example of this, EvANS 55 has listed the reported solubilities of oxygen 
in iron as found by fourteen different groups. The results vary from 
6 to 0-006 per cent oxygen (at 1000C), a marked variation. EVANS 
considers the serious discrepancy to be due to : (a) effects of traces of 
impurities which are often not reported in the literature, (b) the use of 
powder samples which adsorb large amounts of gas, which is reported 
as dissolved gas, (c) contamination of the iron by refractories or volatile 
compounds, (d) insufficient time allowed for equilibrium conditions to 
be reached, and (e) variation in the methods used, and consequent 
variation in the inherent errors. Similar types of errors are probably 
the cause of variations in the reported solubilities in other systems. 
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Most papers on gases and metals refer to SMITHELLS' monograph 1 
(and occasionally to a paper 7 ) which is now, justifiably, a classic in this 
field. In these treatments, SMITHELLS deals with the three processes 
involved between gases and metals : adsorption, diffusion and solution, 
and he shows how the laws governing these phenomena have been 
evolved. 

In the few years after the publication of SMITHELLS ' book, the 
subject of gases and metals advanced rapidly, so that by 1941 there was 
a need for a comprehensive review of the subject. BARRER 3 fulfilled at 
least part of the need by devoting a long section of his valuable mono- 
graph to the subject of diffusion of gases through solid metals. In this 
survey, BARRER dealt with the work which had been done up to that 
time, and developed a reasonable explanation for the phenomenon of 
gaseous diffusion through solid metals. The present author feels that 
this monograph is still to be highly recommended, although it is in need 
of revision. 

Shortly before BARRER'S book was published, SIMONS and HAM 56 
recorded their belief that the solution and diffusion of gases through 
metals could be considered as purely chemical phenomena. It was 
claimed that the specific nature of the solubility of gases in metals 
indicates that the gases are present mainly as compounds with the 
metal system. Since the rate of diffusion of a gas through a metal is 
inversely proportional to the thickness of the metal, it was felt that the 
adsorbed film of gas, usually postulated in diffusion processes, could not 
be related to the empirical results. For this and other reasons, diffusion 
was, therefore, treated as a case of chemical equilibrium between the 
metal and the gas, and on this basis the usual diffusion laws were derived 
without reference to the mechanism of the reaction at the gas-metal 
interface. 

STEARN and EYRiNG 57 briefly considered the diffusion of gases in 
metals from the point of view of a theory of the absolute rates of 
reactions in which the slow process of diffusion entails the surmounting 
of an energy barrier. In this paper some doubt was thrown on the 
possibility of "rift" diffusion which had been suggested as a mechanism 
by SMITH and his co-workers. 2 

FAST 58 has discussed the mechanism of diffusion of gases in metals, a 
subject about which, he says, ". . . very little is yet known with 
certainty." In general the gas, in atomic (or ionic) form, penetrates 
between the metal ions of the lattice. Unlike SIMONS and HAM, 56 
FAST thinks that the surface effect and not the diffusion itself is 
frequently the deciding factor in the diffusion velocity. In a later 
paper FAST 59 considered further the permeability of gases through metal 
walls. As a result of the diatomic character of the gas phase, there are 
five stages involved in the penetration of gases through metal walls : 
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(1) dissociation of molecules into atoms or ions at the surface of entry, 

(2) penetration of the atoms or ions into the metal, (3) diffusion in the 
metal, (4) a transition from the dissolved to the adsorbed state at the 
exit surface, and (5) reassociation of the components into diatomic 
molecules. The speed of the whole permeation process is determined by 
the slowest of these five successive steps. Examples were given of 
experimental cases where each of these five stages is the controlling 
factor and these were discussed in some detail. The influence of the 
surface condition on permeability was explained, and data were given 
to show the effect of temperature on permeability. 

BniNES 60 has written a general review of a number of industrial 
processes which are influenced by gas-metal diffusion which may have 
either beneficial or detrimental effects. The information for that paper 
was gathered by consulting people who were closely associated with the 
various practices in which gas-metal diffusion plays a part. Some of the 
phenomena discussed were : hydrogen embrittlernent, subsurface 
oxidation, and rejection of dissolved gases on solidification of molten 
metal. 

In 1948, BARKER 61 reviewed the factors determining the formation 
of interstitial compounds and some of their properties. While the 
formation of compounds is not really within the scope of the present 
paper, this discussion by BARRER is extremely interesting and valuable. 
It is noteworthy that BARRER, unlike many authors who have written 
about gas-metal systems, regards the interstitial hydrogen dissolved in 
a metal to be substantially unionized. The paper deals with the 
exothermal formation of solutions of hydrogen in metals, equilibrium 
between interstitial and free gas, and rate-controlling processes. 

In the same year, three papers appeared which apply to this discus- 
sion. LEPP 62 studied the absorption of gases in metal by applying 
physico-chemical laws and thermodynamic analysis. SACHSE 03 gave 
detailed abstracts of studies (conducted in Germany during the late war) 
of the diffusion of hydrogen and deuterium in palladium, and of hydro- 
gen in iron. ScHAFER 64 reviewed the work of SIEVERTS done during the 
war years on the solubility of hydrogen and deuterium in nickel, 
niobium, and palladium, and on the effect of hydrogen on the transition 
points in some iron alloys. 

A monograph recently published by SMITH 2 deals entirely with the 
problem of hydrogen in metals. This monograph covers the subject 
with remarkable completeness, and the bibliography of some 1470 
references is very useful. However, very early in his book, SMITH 
questions the possibility of the existence of solid solutions in "endo- 
thermic metal -hydrogen systems," and throughout the remainder of 
the book the discussion is dominated by his "rift" hypothesis. It is 
claimed that the various experimental phenomena can best be explained 
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by the opening or closing of these "rifts" which are apparently lattice 
defects, or widened lattice intervals. One is impressed by the fact that 
the rift hypothesis is entirely qualitative. The concept of rifts might 
also be questioned if the specificity of diffusion rates and solubility data 
are considered. The present writer must agree with RANSLEY, 65 who 
said of SMITH'S monograph : "Indeed, it is not unfair to say that in a 
good many instances the author has strained the facts very severely in 
an attempt to make them conform to this theory, when a simple explana- 
tion based on lattice solution and diffusion would appear to be perfectly 
adequate." There are, of course, cases where the rift hypothesis may 
very well apply, and where it is, in fact, the preferred explanation. 
However, the explanation of such a formidable mass of empirical 
results by one mechanism does not yet seem possible. Since the 
publication of this book, SMITH has written three papers incorporating 
the rift hypothesis in each. 66 ' 67 68 

As part of a book on vacuum techniques, D[JSHMAN IG included a 
section (Chapter 9) on gases and metals. Although much of the experi- 
mental data in this section was published before 1940, the review is 
valuable because it does bring much of the available information under 
one cover. DUSHMAN, among other things, considered the diffusion 
and solubility of hydrogen, oxygen and nitrogen. 

In 1950, BASTiEN 69 discussed the various stages in the permeation of 
a gas through a metal wall. The consideration, which deals particularly 
with the iron-hydrogen system, includes adsorption and diffusion 
phenomena, the effects of temperature and pressure, and effects of 
gas-metal equilibria. 

Recent work by BHAT and LLOYD has shown that the effect of 
cold-work is to decrease the rate of hydrogen diffusion in steel at room 
temperature. It was pointed out that diffusion results obtained by 
electrolytic experiments are only qualitative; this might account for 
many of the disagreements between solubility and diffusion measure- 
ments in other hydrogen-metal systems. For reasons which will not be 
discussed here, the authors believed that, although variation in the 
lattice spacing could affect diffusion rates, probably the occluded 
molecular hydrogen stored at crystallite boundaries is the major 
factor affecting the diffusion process. The correlation of cold- work and 
rate of hydrogen diffusion with changes in hardness and density are 
also discussed. This work is related to the studies conducted by 
ANDREW on hair-line cracks and embrittlement in steel (q.v.). 

IsENBERG 71 has written a brief note to demonstrate why hydrogen 
ionizes when dissolved in a metal. On a semi-classical model the 
conclusion was reached that hydrogen should ionize in all metals. 

A series of studies by GULBRANSEN and ANDREW 72 " 75 on the reactions 
of zirconium, titanium, niobium and tantalum with oxygen, nitrogen 
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and hydrogen provides an excellent review of the work which has been 
done on these metals. The data found by GULBKANSEN and ANDREW 
on the kinetics of the gas-metal reactions add considerably to the 
knowledge of these metals. At present it is difficult to follow the 
latest work on titanium and zirconium, for so much experimental work 
is in progress. The series noted might provide a useful starting point 
for a reader particularly interested in this work. 

A recent review by RniNES, 9 although very short, summarizes some 
of the experimental facts of gas-metal diffusion processes. This seems 
to be the most recent review of this field. 

Although not directly applicable here, it is interesting that equili- 
brium relations in certain (titanium) alloy systems have been investi- 
gated by measuring hydrogen pressures in equilibria with very small 
amounts of hydrogen in solution in the alloys. McQuiLLAN 76 " 79 has 
studied the alpha -> beta transformation in various titanium samples, 
and the effect on the transformation of alloying elements. 

EFFECTS OF DISSOLVED GASES ON THE PHYSICAL AND 
MECHANICAL PROPERTIES OF METALS 

In addition to such effects as hydrogen embrittlement of various metals, 
and porosity caused by gases coming out of solution, there are several 
phenomena of interest which are directly attributable to dissolved gases. 

Internal Friction* 

One of the most interesting of these phenomena is the effect which 
absorbed gases have on the relaxation behaviour of metals. A com- 
prehensive theoretical treatment of relaxation phenomena, to which 
the reader is referred, has been given by ZENER. SO One type of relaxa- 
tion effect is internal friction. It has been found that, under suitable 
conditions, there will be a maximum absorption of energy by a metal 
when an applied vibrational stress has a frequency of the same period 
as that of some mechanism taking place within the metal itself. Such 
mechanisms as reversible grain boundary movement, stress-induced 
diffusion, and thermal diffusion have been postulated as causes for the 
marked dissipation of energy which has been observed in some systems. 
The relationship between magnetic and mechanical relaxation pheno- 
mena was first mentioned by SNOEK. SI It had been pointed out by 
YENSEN 82 that small quantities of both carbon and nitrogen have a 
marked influence on both the magnetic and plastic properties of iron. 
It was thought that the slow decrease with time of the magnetic 

* This section on Internal Friction is presented only as a brief description of this 
phenomenon which is one manifestation of the effect of gases on metals. A comprehen- 
sive review of Internal Friction in Metals by A. S. No WICK is presented elsewhere in 
this volume. 
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permeability in iron at room temperature was due to the segregation of 
nitrogen. Very little exact knowledge was available, however, until 
SNOEK 83 found that such phenomena as magnetic after-effect and time 
decrease of permeability depend on the presence of carbon or nitrogen 
in solid solution. If these solute atoms are precipitated from the 
lattice the phenomenon of magnetic after-effect is no longer present. 
RiCHTER 84 then found that samples of iron which had displayed a 
magnetic after-effect also showed an elastic after-effect. Both effects 
were temperature-dependent, and the absolute values of the relaxation 
times were nearly equal. Finally, SNOEK 85 was able to show that both 
carbon and nitrogen are capable of producing an elastic after-effect 

(relaxation peak) in iron, thus cor- 
relating the magnetic and mechani- 
cal phenomena. SNOEK mentioned 
that there was a true nitrogen solu- 
bility at room temperature and 
that this could be measured by 
means of the elastic after-effect. 

By 1941 SNOEK was able to 
account for these effects on the 
basis of the interstitial positions in 
a body-centred cubic lattice which 
have tetragonal symmetry, the 
tetragonal axis being parallel to 
any one of the three principal axes. 
In an unstrained material, the 
equilibrium distribution of the 
solute atoms among the three 
types of interstitial positions (given by the crystallographic indices 
(00), (0|0), and (OOi)) is quite random. However, if the material is 
elastically stressed in the x direction, there will be a preferred distri- 
bution of solute atoms in the #-type positions (|00). 

The above explanations apply to the case of annealed material. The 
results of SNOEK'S work with cold-worked iron were more complex and 
his explanation only tentative, and admittedly unsatisfactory. 

This relaxation peak discovered by SNOEK appears to be characteristic 
of all body-centred cubic metals. More recently, KE has studied three 
peaks in tantalum, corresponding to carbon, nitrogen and oxygen in 
interstitial solid solution. 

KE 86 has elaborated on the theory developed by SNOEK, and has 
done experiments, using tantalum, which have extended the empirical 
knowledge of the phenomenon considerably. His observations are the 
first experimental evidence that oxygen forms an interstitial solid 
solution with a metal. KE has emphasized the important observation 
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that the internal friction peak for oxygen cannot be interpreted in 
terms of a single relaxation time with a unique activation energy. 
The significance of this will be discussed later. 

K^ has proposed a simple picture of this relaxation effect which 
may be described with reference to Fig. 1, which represents a body- 
centred cubic lattice. If solute atoms occupy the octahedral inter- 
stitial positions in the lattice, then the lattice will be subjected to 
tetragonal deformation with the tetragonal axis along one of the (100) 
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Fig. 2. Internal friction curves for original tantalum specimen and for 

the oxygen-enriched specimen. Both were determined with two 

frequencies of vibration (Kft 88 ). 

directions, depending on which position the solute atom has occupied. 
If a tensile stress is applied along one of the (lOO) directions, then the 
solute atom will prefer to occupy a position such that the tetragonal axis 
mentioned above will be along the same direction. Thus, under fluctuat- 
ing stress, solute atoms will "diffuse" in an attempt to maintain this 
state. Under suitable conditions of temperature and applied frequency 
this "diffusion" process will absorb large amounts of the applied 
vibrational energy. 

Since all interstitial positions are equivalent in the (ill) direction 
then a tensile stress applied in that direction should produce no relaxa- 
tion effect. DiJKSTRA 87 has confirmed this by showing that the relaxa- 
tion strength along a (ill) direction in single crystals of alpha-iron 
is only about 5 per cent of that along a (lOO) direction. 
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In his experimental work, KE SG has determined that the oxygen 
content in solution in tantalum is greater than 0-06 weight per cent 
at 170C. 

In Fig. 2 are shown the internal friction peaks determined at two 
frequencies of vibration caused by carbon (0-013 per cent) in tantalum. 
The higher peaks in the same figure are caused by the composite effect 
of carbon and oxygen which diffused into the metal on annealing in an 
oxygen atmosphere. The contribution due to oxygen alone was 
determined by subtracting the carbon curve value of internal friction 
from the composite value at the same temperature. The internal 
friction peak due to oxygen alone was found to occur at about 20 C 
higher than that due to carbon. 

In Fig. 1 it is shown that there may be two kinds of interstitial 
positions for the solute atoms, and these are shown with their crystallo- 
graphic indices. Each kind of position shown may take up one of three 
types of position, each having the tetragonal axis along one of the 
principal axes, as mentioned before. Despite the different kinds of 
position which the solute atom may occupy, the mechanism of preferen- 
tial distribution will be the same in each case. KE has emphasized that 
the distortion produced when the solute atom is in the octahedral 
position is much higher, and the internal friction peak, therefore, 
greater. 

Since carbon atoms occupy the octahedral positions in martensite, 88 
it is assumed by KE that it is easier for the carbon atom to displace the 
two nearest neighbours of the octahedral position than it is to move the 
four atoms equidistant from the tetrahedral position. Regardless of the 
fact that the octahedral position is smaller than the tetrahedral with 
respect to nearest neighbours, this view is held to be correct. Since 
tantalum has an atomic radius greater than iron, the available inter- 
stitial space in the tantalum lattice will be larger, and the carbon atoms 
can enter the octahedral position. KE was able to demonstrate that 
the carbon peak at 150C has a single relaxation time and a unique heat 
of activation. This indicates that the carbon is distributed in only one 
kind of interstitial position. If this is the tetrahedral position (for the 
150C peak) then there should be another, larger, peak associated with 
the octahedral position. This was not found, so it is assumed that the 
octahedral position was the preferred location for the carbon atom. 

The characteristic internal friction peak for the metal containing 
oxygen in solution showed that there is no single relaxation time, and 
no unique activation energy. It is assumed from this that the oxygen, 
with an atomic radius smaller than carbon, may go into either inter- 
stitial position. KE SG has indicated how the diffusion mechanism from 
one position to another will determine whether or not a single relaxation 
time is associated with the process. 
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KE 89 90 also measured the relaxation strengths of tantalum specimens 
containing small amounts of carbon and oxygen. Experiments at 
temperatures ranging from 350 to 700K indicated that a linear rela- 
tionship exists between the reciprocal of the relaxation strength and the 
absolute temperature. Since this plot is a straight line passing through 
the origin, it is concluded that the interaction between solute atoms is 
negligibly small, and that the critical temperature for self-induced 
preferential distribution of solute atoms is close to absolute zero. This 
linear relationship is held to be in agreement with the previously 
postulated theory of stress-induced preferential distribution of solute 
atoms. No spontaneous ordering can occur in the solutions of carbon 
and oxygen in tantalum. 

It had been reported in an earlier paper 86 that an internal friction 
peak was found to occur in tantalum (with a vibration frequency of 
about 1 c.p.s.) at 350C, as well as those at 150C and 170C, the latter 
two being associated with carbon and oxygen respectively. KE 91 later 
showed that the 350C peak is associated with the presence of nitrogen 
in tantalum. It is seen that the temperature of the peak is abnormally 
high, and experiments were undertaken to decide whether the pheno- 
menon was due to stress-induced diffusion as for carbon and oxygen, or 
to viscous behaviour of the boundaries between tantalum and tantalum 
nitride. It was found that tantalum, annealed in nitrogen at 120()C, 
and quenched from 9()0C, had an initially high internal friction at 
350C but this decreased to a much lower value after 20 hours. Similarly 
annealed material quenched from 600C did not have as high an initial 
value of internal friction and this also decreased to the same equilibrium 
value after 20 hours' annealing at 350C. During this time, tantalum 
nitride needles were precipitated from the supersaturated solid solution 
of nitrogen in tantalum. From this it was concluded that the internal 
friction observed could not be due to nitrogen in precipitated form, since 
then the internal friction should have increased while precipitation 
proceeded. 

This precipitation of nitrogen from the solid solution with tantalum, 
as indicated by the internal friction measurements, is similar to that 
observed by DiJKSTRA 87 in the cases of carbon and nitrogen precipi- 
tating from the solid solutions with alpha-iron. 

KE found the heat of activation associated with the 350C absorption 
peak to be about 44,000 cal/mole, which is much larger than the 
values for oxygen in tantalum (29,000 cal/mole) or for nitrogen in 
alpha-iron, 92 which is 20,000 cal/mole. The observed nitrogen-tantalum 
peak could not be interpreted in terms of a single relaxation time with a 
unique heat of activation. Thus the same mechanism of solution is 
held to be true for nitrogen as was postulated for oxygen in tantalum, 
i.e. the nitrogen may be in either the octahedral or tetrahedral positions. 
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The extremely high heat of activation, and with it the relatively high 
temperature at which the peak occurs, was not explained. It was 
suggested that these phenomena may be due to a larger distortion in 
the lattice. 

KE 92 has also done work on iron and found three internal friction 
peaks to occur at 20, 225 and 490C. The 20C peak is believed to be 
of the same origin as that observed by SNOEK. SI The mechanism causing 
this peak, according to KE, SNOEK and also DuKSTRA, 87 is the anelastic 

effect caused by nitrogen in solution 
which has been described for the case of 
carbon in tantalum. The 225C peak 
which had been observed previously by 
SNOEK SI and WEST 93 was found by KE 
to be caused by the stress-induced diffu- 
sion of nitrogen atoms, as in other 
cases, but "within some peculiar type 
of stress regions created in the specimen 
after it was subjected to cold work." 
No account of this mechanism has been 
given but the heat of activation for this 
diffusion is known to be 32,000 cal/mole 
and the diffusion distance is in the order 
of a few atomic diameters. The 4900 
peak was found by KE to be due to grain 
boundary relaxation in alpha-iron. 

In a recent review on stress relax- 
ation across interfaces, Nowicx 94 has 
indicated that smalt amounts of im- 
purities, microscopically undetectable, 
cause blocking of the interface relaxation when lodged in the 
grain boundaries. These impurities may cause partial or complete 
disappearance of the grain boundary internal friction peak. This type 
of peak is not due to the same causes as the one which has been discussed 
in this section thus far. It is suggested that the reader see NOWICK'S 
paper 94 and the references contained therein for a complete picture of 
this phenomenon. Oxygen, probably in the form of oxides, is quite 
effective in decreasing the grain boundary peak in copper and silver. 
Fig. 3 shows how heating polycrystalline silver in air can reduce the 
grain boundary peak almost to zero. A similar effect has been observed 
for copper. The picture of this effect which NowiCK 94 has reiterated is 
that the foreign particles "create new corners at which slip cannot take 
place," so that the total relaxation may be smaller than in the pure 
substance. According to his hypothesis, ". . . it is to be expected that, 
as impurities are added systematically, the grain boundary peak 
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Fig. 3. Effect of heating in air 
on the grain-boundary relaxa- 
tion in silver. (The steadily 
rising internal friction which 
remains after the peak at 400C 
has disappeared, results from 
another cause) (NowirK 94 ). 
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should not only decrease in magnitude but also move toward lower 
temperatures because the effective size of the interfaces is shortened 
and, therefore, becomes smaller." 

Some of the more recent work on internal friction phenomena due 
to gases in solution has used the internal friction measurements as a 
quantitative indication of the amount of carbon or nitrogen in solution 
in alpha-iron. That the magnitude of the internal friction is strictly 
proportional to the concentration of either the carbon or the nitrogen 
in solid solution, for small solute concentrations, was predicted and 
verified by DiJKSTRA 87 for the case of alpha-iron. In a later paper 
DuKSTRA 95 used this relationship to follow the precipitation of either 
nitrogen or carbon from solid solution, and also to measure their 
solubility in alpha-iron as a function of temperature. It is pointed out 
that the internal friction at the maximum of the internal friction peak 
(Qmax"" 1 ) nas been related to resistivity which has, in turn, been related 
to the solute element concentration. This has shown that Q miiK ~ l is 
nearly equal numerically to the weight per cent concentration of either 
carbon or nitrogen in solid solution. The present writer feels that it is 
not necessary to review DLJKSTRA'S work which is very interesting, 
but slightly beyond the scope of this review. As well as the facts above, 
DIJKSTRA stated also that hydrogen had no effect on the internal friction 
of alpha-iron in the temperature region between 40 and 100C. 

Despite the fact that PAST 96 had reported that the nitrogen peak in 
iron becomes broader and appears at a higher temperature (nearer the 
carbon peak) due to the presence of manganese in iron, JOSEFSSON and 
KuLA 97 have applied the internal friction peak effect to the study of 
ageing and related phenomena in commercial mild steel. This work has 
confirmed the fact that the presence of impurities, such as manganese 
(and perhaps phosphorus), does change the position of the nitrogen peak 
and broadens the internal friction curve. Also it was confirmed that the 
carbon peak was not changed in position, although it may have been 
broadened by the addition of manganese to the iron. Ageing curves 
(as a function of internal friction) have been given for four steels which 
were each treated in several ways. The removal of nitrogen from solution 
by means of an aluminium addition, and the tendency for nitrogen to 
remain in supersaturated solid solution, even at slow cooling rates, in 
certain steels was also shown. 

Finally, the most recent work which has been published is a brief 
note by MARINGER and MuEHLENKAMP, 98 who have found an internal 
friction peak at 1300P, and 1 c.p.s., in molybdenum, caused by the 
stress induced diffusion of nitrogen atoms in solid solution. 

Other effects, similar to those which have just been described, have 
been reported in the literature. KOSTER" studied an unexpected 
decrease in the modulus of elasticity and increase in damping capacity 
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with the introduction into several metals of solute atoms of small 
solubility. It was found that the anomalous behaviour was due to small 
amounts of oxygen held in the copper and silver being studied. KOSTER 
decided that, from knowledge of the nature of damping, the effect can 
be ascribed to stress-induced diffusion of the solute atoms. Later, 
ZENER IO was able to show that the presence of residual elastic strain 
energy is associated with a reduction of the elastic moduli, in agreement 

with ROSTER'S observations. It 
is proposed that, as a solute 
atom passes over the potential 
energy barrier between two 
stable interstitial positions, the 
lattice acquires additional strain 
energy which is associated with 
a lowering of the elastic moduli. 
Two brief summaries of papers 
by MARX IGI and KoEHLER 102 add 
to the information available 
on the effect of hydrogen in 
copper. In each case the hydro- 
gen has a marked effect on the 
elastic properties of the metal. 
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Some studies have been carried 
out on the effect of gases on the 
Fig. 4. Photoelectric activity of zirconium photoelectric work-function of 
(KENTsrHLKR and HENRY 104 ). some metals. When such metals 

as thorium, uranium, calcium, 

caesium, and barium react with oxygen, the photoelectric threshold 
is shifted toward a longer wavelength. RENTSCHLER and HENRY 103 
first studied the metals just mentioned and later 104 studied zirconium, 
titanium, hafnium, and thorium. In the latter paper it was pointed 
out that when oxygen, for instance, is present as an oxide at the 
grain boundary of a metal, a sputtered film from this metal forms a 
surface from which the true photoelectric threshold characteristic of 
the pure metal may be measured. It is also to be expected that, if the 
gas is in solution in the metal, surfaces sputtered from different batches 
of the metal will yield different values of the photoelectric sensitivity, 
due to variations in the gas content of the different batches. RENT- 
SCHLER and HENRY predicted that this would be more a difference in 
relative sensitivity to different wavelengths than in a given threshold 
wavelength. It was, in fact, found that, unlike the other metals men- 
tioned above, different photo-cells of zirconium and titanium responded 
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differently to various wavelengths even though the photoelectric 
threshold was the same. 

RENTSCHLER and HENRY 304 were able to show the effects of various 
treatments with gas on the photoelectric response of zirconium. When 
a small amount of oxygen reacted with the active metal surface the 
threshold wavelength increased. Fig. 4 shows the photoelectric activity 
of zirconium in different conditions. Curve 1 is the response of pure 
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Fig. 5. Photoelectric activity of titanium (RENTSCHLER and. HENRY 104 ). 



zirconium metal. Curve 2 is the response from a surface sputtered from 
zirconium previously heated to 400C in hydrogen. Curves 3 and 4 
show the response from surfaces sputtered from zirconium heated in 
nitrogen and oxygen respectively. It was found that the response from 
a surface sputtered from an oxygen-bearing wire does not depend on the 
time of sputtering. Thus the lower work function is independent of the 
film thickness provided the sputtering is done from metal with the gas 
uniformly dissolved throughout the metal. Also the threshold wave- 
length was found to be independent of the oxygen content (provided 
there was some in solution), but the shape of the response curve is 
dependent on the oxygen content. These facts were interpreted to 
mean that the film consists partly of pure metal with no gas, and partly 
of metal containing gas in solution, the former having a low threshold 
and the latter a higher threshold wavelength. 
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Pig. 5 shows the same type of thing for titanium where the effect of 
the dissolved gases is even more marked than for zirconium. Curve 1 
represents the photoelectric response for pure titanium ; Curve 2 is 
the response for a solution of nitrogen in titanium ; and Curve 3 the 
response for oxygen-bearing titanium. 

In an interesting, but only qualitative experiment, RENTSCHLER and 
HENRY found that titanium with oxygen in solution is not as volatile 
as the pure metal. This effect has also been found with zirconium. 

It was found that thorium behaves like zirconium and titanium but 
only in so far as the gas increases the threshold wavelength. Both 
oxygen- and nitrogen-treated surfaces have a response similar to that of 
Curve 2, Fig. 4, i.e. a response lower, at low wavelength, than that of the 
pure metal. Since the thorium behaved like zirconium and titanium 
where the gas has reacted only with the surface of the metal, it was 
assumed that the gas does not dissolve as readily throughout the 
thorium. No explanation was given of the reason for this effect. 

Recently, ANDERSON 105 has studied the work functions and ageing 
characteristics of copper. The work functions of fourteen surfaces 
were measured by means of contact differences of potential with respect 
to barium surfaces of a known work function. The surfaces were pre- 
pared from spectroscopically pure copper which had been subjected to 
40 vacuum fusions and fractional distillations. The result of some 200 
measurements showed that the first films to be deposited were not 
reproducible, and drifted rapidly with time to a lower work function. 
Later deposits were more reproducible, and the rate and magnitude of 
drift decreased. For the specimens 9-14 there was good reproducibility 
and no drift. At no time was there any drift to a higher work function. 
The variability is due to dissolved gases which are retained by the 
copper despite vacuum fusions, and are reabsorbed into the condensed 
film. Since the gas, presumed to be oxygen, lowers the work function, it 
is concluded that the gas is in solution and not merely adsorbed on the 
copper. This conclusion is reached since oxygen adsorbed on a clean 
tungsten surface raises the work function, and copper would be expected 
to behave similarly if it had an adsorbed gas film. However, it must be 
emphasized that there was no proof of the presence of oxygen in the 
copper. FAST 58 agrees that the adsorption of oxygen causes an increase 
in the work function. 

A number of papers on the influence of hydrogen on photoelectric 
sensitivity have been listed by SMITH. 2 Except for those reported in 
this paper there appears to be no recent work on this subject. 

Lattice Expansion 

Recent work on the lattice expansion of metals due to the presence of 
dissolved gas in the lattice appears to be almost non-existent. A 
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relatively recent paper, which has some bearing on the subject, is by 
HuME-RoTHERY and ANDREWS. 106 In studying the lattice spacing and 
thermal expansion of copper, they accidentally oxidized some copper 
at 771C. This metal was cooled to room temperature, reheated to 
870C for one hour and again cooled to room temperature. X-ray 
analysis of this showed oxide lines, but the copper lattice spacing was 
the same as that of pure copper. At higher temperatures the lattice 
spacing of the oxidized metal was only slightly higher than that of the 
pure metal, the difference at 770C being 0-0004 A. No attempt was 
made to examine equilibrium conditions. 

A large part of the available information on metals which dissolve only 
small amounts of gas is that for electrodeposited chromium. HUTTIG 
and BRODKORB 107 report that it has a body-centred cubic lattice which 
has a larger parameter than that of chromium which has been melted, 
and this was taken to indicate the entry of hydrogen into solid solution. 
WooD 108 later found, along with a marked line broadening, a very slight 
displacement of the lines of a diffraction pattern. He concluded that 
a small part of the hydrogen in chromium is in solid solution, but that 
the major portion is adsorbed on the walls of "crystallites" which cause 
the diffuseness of the diffraction lines. HUME-ROTHERY and WYLLiE 109 
confirmed the formation of a body-centred cubic structure, and recog- 
nized the possibility of "abnormal lattice spacings due to co-deposited 
hydrogen" but did not study the effect in detail. 

Recent work has been done on electrolytic chromium by POTTER and 
LuKENS, 110 who measured the effect of hydrogen on the lattice parameter. 
As in other cases, the electrodeposited metal gave a normal body-centred 
cubic diffraction pattern with the lattice parameter somewhat enlarged 
compared to gas -free metal, and with a broadening of the lines. This 
was taken to indicate an enlargement of the lattice with variations in 
the lattice parameter of 0-3 to 0-4 per cent, causing the line broadening. 
On degassing at 500C, the parameter decreased somewhat and the lines 
were less diffuse. The sample degassed at 1000C showed a normal lattice 
with sharp lines of almost the correct spacing. Similar effects had been 
observed with manganese 111 on degassing at 300C and 500C, 
respectively. 

In discussing this paper, SNAVELY briefly described a theory which 
he had developed from experimental evidence reported in another 
recent publication. 112 It was suggested that unstable chromium 
hydrides were deposited, with the hydrogen atom in interstitial positions 
forming either a wurtzite- or fluorite-type structure with the chromium, 
depending on the hydrogen content. Since these hydrides are unstable 
at ordinary temperatures they dissociate rapidly after deposition to 
form atomic hydrogen and body-centred cubic chromium. A large 
volume change (15 per cent) accompanying this reaction is believed to 
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be related to the hardness, brittleness, and cracking characteristics of 
this type of electrodeposit. "Microfissures" are formed during the 
volume change and the hydrogen, which has a very low solubility in the 
chromium lattice, is "occluded" in the fissures. These fissures are 
apparently the cause of the line broadening in the diffraction patterns. 
POTTER and LTJKENS had suggested the possibility of a supersaturated 
interstitial solution of hydrogen in chromium. However, SNAVELY 
pointed out that the body-centred cubic chromium lattice has interstices 
which are about 1-58 A in least diameter, while the hydrogen atom is 
less than 1 A in diameter. SNAVELY IIO 112 considered that "an interstitial 
atom must be sufficiently large to contact all surrounding atoms if a 
stable or even metastable interstitial solid solution" is to form, and, 
therefore, the possibility of chromium absorbing hydrogen is small. 

SMITH 2 reports work by two investigators who found lattice disten- 
sions in iron and nickel which had been electrodeposited from solutions 
containing arsenic. SMITH claims that the observed increase in para- 
meter, accompanied also by line broadening, is due to the arsenic. 

It must be appreciated that the gas content of an electrodeposited 
metal is not an equilibrium gas content, and may cause strains not 
present in an annealed metal. It seems that no lattice parameter 
increase has been found in those metals which dissolve only small 
amounts of hydrogen when in equilibrium with the gas phase. This is 
really to be expected if one considers the size of the hydrogen atom com- 
pared to the relatively large interstitial space available for the small 
number of hydrogen atoms absorbed. However, SMITH 2 uses this 
absence of true lattice expansion as an argument in favour of his "rift" 
hypothesis. 

POTTER and HTJBER UI have studied the effect of hydrogen on the 
lattice parameter of electrolytic manganese. It was found that the 
manganese lattice is distended, the average lattice parameter enlarge- 
ment being about 0-0003 per cent per c.c. of gas per 100 g of metal. A 
widening of the lines in the x-ray diffraction patterns of gas -bearing 
specimens indicated a deviation from the average parameter caused by 
non-uniform distortion of the lattice. This non-uniform lattice was 
attributed to non-uniform gas distribution. The effect of various 
degassing treatments on the metal indicated that the parameters were 
not affected unless the metal is heated under a reduced pressure at over 
300C. At 500C and over, the lattice returned to its normal size and 
the formerly broadened diffraction lines became sharp. It is pointed out 
that although the hydrogen in solution is only 0-02 per cent by weight, 
under normal conditions it would occupy a volume twenty times that of 
the metal containing it. The authors state further : "It is not unreason- 
able to expect that this highly compressed gas would have some effect 
on the structure of the metal containing it." The concept of this 
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dissolved gas being merely "highly compressed gas" is hardly in keeping 
with other information. The non-uniformity in the lattice parameters 
mentioned above is not proportional to the amount of gas in the metal. 
The deviation from the mean lattice parameter (line broadening) is 
zero at zero gas content, high at 95 c.c. per 100 g of metal, and decreases 
with increasing gas content up to the observed gas content of 615 c.c. 
per 100 g of metal. It is shown that this indicates that the deviation 
might again be zero at a value of about 695 c.c. hydrogen per 100 g 
manganese, when the metal unit cell would contain exactly 2 gas atoms. 
Since there was some diffuseness in the lines at 350 c.c. of hydrogen 
per 100 g of manganese it was concluded that the average distribution of 
1 atom of gas per unit cell was not uniform and that the cells could 
take two atoms each. 

In another paper 113 it is inferred on the basis of related phenomena 
described in the literature, that, owing to the absorption of hydrogen 
by nickel, systematic lattice defects arise in the electrolytically deposited 
metal. These defects amount locally to about a 13 per cent change in 
the lattice constant. These local lattice distortions cause the fine- 
grained structure and hardness of the deposit. In another experimental 
study 114 it is shown that hydrogen participates directly in the formation 
of the structure of the electrodeposit of nickel. One other abstract 
refers to a paper 115 which describes an experimental spectrographic 
study of the state of hydrogen in a metal, the action of x-rays and the 
structure of the lines of the K^i-ax doublet. It is claimed that hydrogen 
in the form of protons is a factor in determining the structure of the 
doublet and also the properties of the material. 

Considering those metals which dissolve larger amounts of hydrogen 
SMITH 2 in various parts of his monograph discusses the effect of 
hydrogen on the metal lattice. Most of the experimental work has been 
done on palladium, but information is given for other metals which 
absorb hydrogen readily. Representative of the studies on palladium 
is the work of OWEN and JoNES 116 117 which will be summarized here. 
By making measurements of the palladium lattice parameter under 
isothermal conditions and at varying pressures, the hysteresis effect 
when the pressure of the gas is taken through a cycle was determined. 
Fig. 6 shows the results which were obtained at 100C. Those parts of 
the isothermals in which only a single component existed could be 
reproduced by simply altering the pressure. This reproducibility 
disappeared, however, when a second phase was present. 

It was found that the hydrogen solubility changed rapidly when the 
beta phase appeared or disappeared. The measurements made are 
interpreted as meaning that the system consists of two solid solutions 
at the temperatures considered, namely 100 and 120C. The hydrogen- 
rich beta phase is closely associated with the combination Pd 2 H which 
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is capable of dissolving more hydrogen. The beta phase was always 
distorted no matter what precautions were taken to obtain it without 
distortion. It had been suggested earlier that this phase had an 
orthorhombic lattice, but this was not confirmed. It was found too that 
the distortion of the lattice remained after removal of the hydrogen. 
OWEN and JONES state that, "The presence of the hydrogen, even 
though it enters the palladium lattice interstitially, causes a permanent 
displacement of the palladium atoms which if the concentration is high 
enough, cannot be removed without subjecting the material to a 
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20 30 W 50 
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Fig. 6. Effect of hydrogen pressure on lattice parameter of palladium 
(OwKN and JoNKs 116 ). 

temperature well above that required to remove the hydrogen from the 
metal. This would suggest that at certain concentrations the gas may 
not be in its normal state when it leaves the metal." 

Recently CLARK IIS has reported a new determination of the lattice 
constants of alpha (hexagonal close-packed) titanium. Comparing his 
results with earlier work, CLARK found agreement with FAST'S work but 
not with others. CLARK is able to show how the C f value is markedly 
increased by oxygen and nitrogen which form interstitial solid solutions, 
while the a value remains essentially unchanged. Since the only points 
of disagreement with earlier work were in the G' values, CLARK concluded 
that the other materials tested must have been contaminated with 
oxygen or nitrogen. Fig. 7 shows CLARK'S results indicating the effect 
of the two gases on the lattice. 

BUSK and BoBALEK 119 found a similar effect in the case of a 
magnesium +10 per cent aluminium alloy charged with hydrogen in 
an alkaline solution. In this case both parameters had increased 
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significantly, and it was concluded that the hydrogen had dissolved 
interstitially. 

The most recent discussion on the lattice parameter has been pre- 
sented by SMITH. 68 X-ray evidence of so-called "rifts," or widened 
lattice spacings in work-hardened metal, as compared with the relatively 
continuous and uniform lattice of fully annealed metal is discussed in 
connexion with hydrogen solubility. SMITH claims that the charac- 
teristic line-broadening associated with the absorption of hydrogen is 
due to widened lattice spacings, caused initially by plastic deformation, 
but which may be extended by hydrogen absorption. The development, 
manifestations, and effects of the "rifts" are described in this paper. 68 
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Fig. 7. The oifect of the addition of small amounts of oxygen arid 
nitrogen 011 the lattice parameter of alpha titanium (CLARK 118 ). 

Studies of the relationships between lattice parameter, hardness, 
resistivity, and volume changes have been carried on for a number of 
years by a French group. PORTEVIN, CHAUDRON and MoREAU 120 
showed that the electrodeposited metals are much harder, have lower 
conductivity and have broader x-ray diffraction lines than annealed 
metals. By using the cold bombardment method for degassing, it was 
shown that the removal of hydrogen from electrodeposited iron altered 
neither the hardness, the lattice parameter, nor the diffuseness of the 
lines. Annealing after cold degassing caused no appreciable removal of 
gas, but did cause a decrease in parameter, a sharpening of the x-ray 
lines, and the expected softening of the metal. A special type of cold- 
working by the hydrogen atoms was postulated, the metal remaining in 
a strained condition after removal of the gas. Cathodically charged 
Armco iron was found to have a decreased tensile elongation and 
increased resistivity. Similar changes were found in the properties of 
aluminium after the same treatment. 

In studying the palladium -hydrogen system, MiCHEL 121 found that, 
on changing the composition from pure Pd to Pd 4 H by the addition of 
hydrogen, the Brinell hardness increased from 47 to 170, while the 
lattice parameter increased from 3-882 to 3-885 A, the hydrogen atoms 
occupying the body-centred positions in a face-centred cubic palladium 
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lattice. The beta phase, PdH, was found to have a Brinell hardness of 
only 70. On heating PdH at 310C, hydrogen was slowly evolved and the 
hardness rose to 170 Brinell after about 2 hours. After that time the 
alloy with a composition corresponding to Pd 4 H had a hardness which 
slowly fell to a value limited by the annealing temperature. When the 
PdH was degassed by cold bombardment, the lattice remained distorted 
and the hardness was unaffected. 

The results which had previously been obtained for iron and alumi- 
nium were reproduced in the case of palladium and tantalum by 
CHAUDRON, PORTEVIN and MoREAU. 122 The results obtained for palla- 
dium were as follows : 



Composition 


Ratio of 
Ftesistiviticx 


Lattice, 
Parameter 


Brinell 
Hardiicfia 


Pure arinealod Pd ...... 


1 


3-882 


45 


Beta phase ...... 


1-69 


4-020 


77 


Beta phase degassed (ionic bombardment) . 


1-69 


4-020 


77 


Beta phase degassed and annealed at 600 C 


1 


3-882 


45 


Alpha phase ....... 


1 -45 


3-885 


170 


Alpha phase degassed (ionic bombardment) 


1-45 


3-885 


170 


Alpha phase degassed and annealed at tJOOG 


1 


3-882 


45 











MICHEL, BENARD and CHAUDRON 123 followed the changes of the 
lattice parameter and diffuseness of the x-ray diffraction lines as 
palladium was electrolytically charged with hydrogen. Since the beta 
phase appeared as soon as hydrogenation started, and from subsequent 
observations, it was concluded that a superficial layer of the beta phase 
is formed and the hydrogen diffuses from this phase into the metal to 
form the alpha solution. When the inner portion of the metal has 
reached a composition corresponding to Pd 4 H, then the beta phase 
starts to advance into the body of the metal by diffusion of an excess of 
hydrogen over that amount required for the alpha phase. On exposure 
to air, hydrogen was found to leave the beta phase, the lattice para- 
meter decreasing to a minimum value of 4-01 A (at room temperature). 

Finally MiCHEL 124 carried out a thermal investigation on the hydrogen- 
palladium system by dilatometric and differential analysis. The 
results obtained were interpreted as being in agreement with the 
results of the previous x-ray analysis. The diagrams in Fig. 8, taken 
from MICHEL'S paper, show the dilatometric curves for both the alpha 
and beta phases. On heating in nitrogen, the alpha phase starts to 
contract at about 200C, and the beta phase starts at about 100C. 
In each case the contraction is complete at 300C. This is shown in 
parts a and b of Fig. 8. The differential thermal analysis of the beta 
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phase, heated in nitrogen, is shown in part d. Starting at room tempera- 
ture there is an absorption of heat due to the removal of hydrogen, 
and this is followed by a marked exothermic effect due to the contrac- 
tion of the lattice at about 300C. The same experiment repeated on a 
specimen which has been degassed by ionic bombardment will yield 
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Fig. 8. Thermal arid dilatometric rolatiorisliips in the palladium-hydrogen 

system (MiCHEL 121 ). 

the curve shown in part e. The exothermic effect persists, but no heat 
is required to remove the hydrogen. 

PoRTEViN 125 briefly expressed the same ideas that the distortion of 
the crystal lattice persists after degassing, because of a "cold-working" 
effect by the gas atoms. The temporary introduction of gas atoms may 
be used to modify the resistivity of the metal. 

Magnetic Properties 

Although the present author has found no report of any effect of 
oxygen or nitrogen on the magnetic properties of metals, the effect of 
hydrogen is well established for palladium and iron. It has been found 
that the strong paramagnetism of palladium decreases, on absorbing 

133 



PROGRESS IN METAL PHYSICS 

hydrogen, from the value of the susceptibility for pure palladium to a 
final value which is reached at a concentration of H/Pd = 0-55 to 0-66. 
The susceptibility remains approximately the same at concentrations 
higher than the rather uncertain limiting composition. It should be 
noted that the decrease in susceptibility is not due to the formation of 
the beta phase, since the addition of a small amount of hydrogen will 
lower the susceptibility prior to any possible beta formation. SMITH 2 
has reported several explanations of this effect, the simplest probably 
being that of MOTT and JONES 126 who concluded that the hydrogen 
electron goes into the d shell, thereby decreasing the number of vacancies 
in the palladium d shell and hence the paramagnetism. Since the palla- 
dium atom has 0-55 holes per atom in the d shell, and since it had been 
shown that the magnetic susceptibility of the palladium was reduced to 
zero by the addition of about 0-55 hydrogen atoms to each palladium 
atom, MOTT and JONES also concluded that the dissolved hydrogen 
atom must be completely ionized. Since some investigators report the 
H/Pd ratio to be as high as 0-66 when the loss of paramagnetism is com- 
plete, SMITH 2 concludes that one-sixth of the hydrogen atoms remain 
uncombined and are contained "either in the interstices of the distended 
lattice of the beta phase or outside the lattice proper in the rifts. ..." 

SIEVERTS and DANZ 127 have shown that the effect of deuterium on the 
magnetic susceptibility of palladium is exactly the same as that of 
hydrogen. Thus it may be assumed that the deuterium atom also yields 
its electron to the d shell of palladium thereby rendering it diamagnetic. 

y OGT i28 did experiments which indicate at least roughly that the 
addition of gold to a palladium-hydrogen system causes a similar 
decrease in the magnetic susceptibility as that brought about by the 
hydrogen. This lowering of the susceptibility is complete at 50 to 60 
atomic per cent gold, and at this composition the hydrogen was found 
to be insoluble. VOGT concluded that gold and hydrogen have equal 
effects on the paramagnetism of palladium. 

The most recent information which the writer has boon able to find 
in the literature is a paper by FITZWILLIAM, KAUFMANN and SQUIRE 129 
who studied the effect of hydrogen on the magnetic properties of 
titanium and zirconium. An interesting point is that, on absorbing 
hydrogen, the paramagnetic susceptibility of titanium increased. The 
reason for this was not explained. Only one composition (1-2 atoms of 
hydrogen per atom of titanium) was apparently considered. In the 
original zirconium specimen, an iron content of 0-11 per cent played 
no part in the magnetic susceptibility except at liquid hydrogen 
temperatures. However, on addition of hydrogen and with suitable heat 
treatment, "the iron appeared to precipitate out of solution with the 
zirconium in that it gave a ferromagnetic component to the magnetiza- 
tion." It is believed that the iron must have actually formed clusters in 
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order to cause the observed effects. However, the authors were able to 
separate the effect due to hydrogen alone, and that due to iron, on the 
susceptibility and they have presented a curve showing the paramag- 
netic susceptibility of zirconium as a function of hydrogen concentra- 
tion. The curve is similar to that for the palladium-hydrogen system but 
differs in that, instead of being a linear decrease, the curve is concave 
toward the axes, and a constant value of susceptibility is reached at a 
positive, rather than a diamagnetic, value as shown in Fig. 9. It is 
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Fig. 0. Magnetic susceptibility of the system Zr-IT as a 

function of hydrogen concentration at 295K (FITZWILLTAM, 

KAUFMANN arid SQUIRE 129 ). 

interesting to note that the Curie point of the precipitated iron was 
found at 585K, instead of near 1000K, as in the lattice of pure iron. 
Other experiments relating structure, temperature, hydrogen content, 
and susceptibility were also described. 

The effect of interatomic distances on susceptibility has been applied 
by MICHEL in an attempt to explain the magnetic behaviour of the 
palladium-hydrogen system. In 1939 MICHEL and GALLissoT, 130 using 
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PdH degassed .... 
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R = electrical resistance 
R resistance of pure Pd 
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the cold-bombardment method of degassing developed by CHAUDRON 
and MOREAU, determined the respective effects of hydrogen and inter- 
atomic distance in the variation of the susceptibility of palladium. 
The experimental results are given in Table 3 on page 135. These 
results seem to show that the magnetic susceptibility is unaffected by 
degassing. The authors attribute the large variations in susceptibility 
to the modification of the interatomic distances by the dissolved 
hydrogen. 

The effect of hydrogen on the magnetic properties of those metals 
which absorb only small amounts of the gas is not so well explained as 
in the systems just discussed. SMITH 2 has reviewed the effects of 
hydrogen on the magnetic properties of iron and manganese, and 
concludes that the influence of hydrogen is probably due to its effect in 
cold-working the metal and also its purifying effect. It is indicated 2 that 
the susceptibility (and permeability) of iron at low field strength (about 
10 oersteds) increases with decreasing hydrogen content, in much the 
same way as in the palladium -hydrogen system. 

Extremely high permeabilities have been reported for iron annealed 
near 1500C in wet hydrogen for 30 minutes, and then cooled rapidly to 
a temperature just below the A 3 -poirit of pure iron, and held there for 
some time. The author attributed the high permeability to the high 
purity of the metal, due to the hydrogen treatment, and not to any 
residual hydrogen content. SMITH 2 claims that the effect is due to the 
annealing only, the atoms (of metal) being "no longer in the altered 
states in which they presumably exist in the vicinity of a rift." WALL 131 
confirmed the improvement in the magnetic properties of iron by the 
hydrogen treatment, and found similar improvements in the case of 
cobalt and nickel. 

WHEELER 132 found that the absorption of hydrogen by manganese 
lowered the susceptibility of alpha-manganese at 450C, and beta- 
manganese at 850C, but above 1200C raised that of face-centred 
tetragonal or molten Mil. At no time was the metal reported to 
become ferromagnetic. 

SMITH 2 concludes that the modification of the magnetic properties of 
those metals which do not dissolve large amounts of hydrogen must be 
due to some cold-working of the metal. At the same time it is recog- 
nized that a similar cold- working should take place in Pd, and this 
would be expected to alter the linear dependence so well explained by 
the electron theory of metals. This anomaly does not seem to have 
been explained. 

Finally, SMITH 2 has attempted to show a reciprocal relationship 
between paramagnetic properties and capacity to absorb hydrogen. It 
is stated that all highly magnetic metals absorb measurable amounts 
of hydrogen, while elements with % less than 0-8 x 10~ 6 do not, with 
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five exceptions, absorb hydrogen. SMITH has proposed that the relation 
between paramagnetism and hydrogen content may be due to some 
process which precedes a final arrangement. This process is postulated 
to be the "penetration of the metal" which factor is determined by "the 
rift-state." It is suggested that the "rifts" cause "changed electronic 
configurations" which in turn alter the susceptibility. Earlier, in 
explaining the sudden decrease in the magnetic permeability of iron 
after being cathodically charged with hydrogen for some time, 
SMITH said : 

"Since saturation is that concentration of a phase which is in equili- 
brium with a certain pressure, we shall be making little more than a 
change of terminology if we suppose the critical stage to be that at 
which the pressure of the rift hydrogen, steadily building up toward 
the high effective pressure of electrolysis at high current density, 
reaches a value at which the lattice lets go and sudden enlargement of 
the whole rift system takes place, the resulting increase in inter- 
atomic spacings causing alterations of electron configurations in the 
atoms, which are, in turn, perhaps responsible for the changes in 
magnetic properties." 

The present writer finds it very difficult to visualize a metallic 
lattice, in the so-called elastic range, "letting go". 

SEEMAN 133 repeated some experiments done earlier by SHIMIZU on 
the effect of gas on the susceptibility and electrical conductivity of 
gold-silver alloys before and after heating in vacuo, heating in nitrogen, 
and remelting in vacua. The above treatments had no effect on the 
magnetic properties of either the alloys or the pure metals. SEEMAN 
stated that the effect of oxygen on the susceptibility of diamagnetic 
and paramagnetic alloys is still not accurately known, but for precise 
determinations complete degassing of the specimens was recommended. 

MAiiN 134 has stated that dissolved gas has a marked influence on the 
value of the magnetism and, therefore, the alloys (the thallium-lead 
system was used as an illustration) must be rigorously annealed in 
vacua after each measurement. 

Electrical Resistivity 

The present writer has been able to find very little information des- 
cribing the effect of dissolved oxygen or nitrogen on the electrical 
resistivity of a pure metal. For those metals which dissolve only small 
amounts of hydrogen there appears to be no recent experimental 
evidence to show a certain effect. SMITH 2 has briefly reviewed the 
literature available on this subject up to 1946 and the reader is referred 
to SMITH'S supplementary notes (pp. 258, 262, 265, 267, 268, 275) for 
other references. Although the evidence seems to be rather uncertain, 
and the experimental work was done many years ago, it seems that the 
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resistivity of iron will increase a small amount on absorbing hydrogen 
from the gas phase. If the iron is charged with hydrogen cathodically 
then the resistance will increase to a limiting value of hydrogen content 
and thereafter starts to decrease. This effect is similar to that which 
occurs when palladium absorbs hydrogen, and will be described later. 
When nickel absorbs hydrogen from the gas phase, experimental results 
indicate any one of the following effects : (a) an increase, (b) a decrease, 
and (c) no effect on the resistivity. When the nickel was cathodically 
charged with hydrogen the effect is also doubtful. One report has 
indicated a decrease in resistivity due to "protonic conduction." The 
same remarks as have been made for nickel seem to apply also to the 
case of platinum. No effect of any kind on the resistivity of copper and 
silver has been found due to hydrogen. 

As in the case of all other physical properties of the palladium- 
hydrogen system, the electrical resistivity seems to have been studied 
quite thoroughly. As in the case of other properties, SMITH 2 ' 135 has 
used the experimental evidence in the case of palladium-hydrogen to 
support his view that a large proportion of the hydrogen present in 
palladium is concentrated in "rifts." His arguments will not be 
presented here, but in preference to them the writer will review some 
recent work by WRIGHT 136 which is in agreement with the earlier x-ray 
work discussed by OwEN. 137 It is pointed out that the exceptional 
properties of the transition metals are due to the partly filled d shell. 
It was stated in the discussion of magnetic susceptibility that palladium 
has approximately 0-6 unoccupied states per atom in the d band. 
Generally it is accepted that the hydrogen dissolved in palladium is 
almost completely ionized, and its electrons enter this partially filled 
d band. Thus the palladium d shell is filled when the H/Pd ratio is 0-6. 
This seems to be true for palladium in equilibrium with the gas phase, 
but SMITH 2 has shown several cases where the H/Pd ratio is higher than 
0-6 when the palladium is electrolytically charged with hydrogen. 
This has been explained by SMITH to be due to hydrogen being held in 
the widened lattice spacings of the metal lattice. WAGNER, 138 in calcu- 
lating the hydrogen solubility from thermodynamic principles, has 
assumed that this excess hydrogen is located interstitially in solid 
solution, the electrons entering the s band of the palladium. WRIGHT 
has presented some theoretical ideas proposed by LACKER and WAGNER 
to account for the form of the concentration-pressure isotherms of 
hydrogen in palladium. It was concluded by WAGNER that the existence 
of a hysteresis loop is due to a failure to establish thermodynamic 
equilibrium. The relatively low temperatures of observation allowed 
only restricted exchange of palladium atoms between the two phases of 
the palladium-hydrogen system. Experimental studies of the hysteresis 
effect in the palladium-hydrogen system indicated that reversible 
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isotherms arc obtainable only by using palladium in a finely divided state. 
Quite contrary to the definite opinions of SMITH 2 * 135 , WEIGHT has 
interpreted the work of SIEVERTS and his co-workers, 24 * 26 ' 139 14 and 
of OWEN and his co-workers, 116 * 137 > 141 to indicate that "the mixed 
phase or (alpha + beta) regions within the hysteresis loop are charac- 
teristic of a system in which the thermodynamic equilibrium is not 
completely established. The x-ray studies also indicate that the new 
phase, alpha or beta, exists finely dispersed throughout the old phase, 
the establishment of a new 
phase occurring, presumably, 
by the process of nucleation 
and growth." LACKER'S 
formula, which explains the 
hysteresis effect, is based on 
assumptions which are sup- 
ported by the above conclu- 
sions. SMITH has claimed these 
assumptions to be invalid in the 
light of experimental evidence 
which appears to indicate that 
the two phases of the palladium- 
hydrogen system cannot exist 
interdispersed. WRIGHT'S ex- 
periments have definitely shown 
that the (alpha + beta) phase 
does exist over a small range 
of pressure. 

WRIGHT determined resis- 
tance isotherms at several 

temperatures, allowing sufficient time after each pressure change for 
the resistance to become almost constant. The results of his measure- 
ments are shown in Fig. 10. It was shown that the limits of the 
hysteresis loops were quite close to the corresponding limits for the 
occlusion isotherms. The transitions from one phase to another have 
been described on the basis of x-ray, solubility, and electrical resis- 
tance measurements. If one accepts that the nuclei of a new phase 
are dispersed throughout the old phase, then the hysteresis effect may 
be explained by the contribution of the surface between the two 
phases to the free energy of the system. WRIGHT has described this 
mechanism : 

"The vapour pressure of the dispersed beta-phase particles would be 
greater than the pressure they would exert in bulk, hence they could 
not form without some degree of supersaturation in the alpha phase. 
Similarly during evolution of hydrogen, an undersaturated beta phase 
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would be necessary for the formation of alpha-phase nuclei. Over a 
certain range of pressure up to the point where the isotherms rise (or 
fall) sharply, it must be assumed that the two phases co-exist in a 
state of metastable equilibrium. Beyond this stage, the degree of 
supersaturation in the alpha phase (or undersaturation in the beta 
phase) is such that the system becomes unstable and the new phase is 
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Fig. 11. Concentration-pressure isotherms at 100 C. 
Experimental: SIEVERTS and DANZ, - 

Theoretical : T. LACIIER , equilibrium 

II. WAGNER , path followed between upper 
and lower isotherms (WRIGHT 136 ). 

rapidly formed in fairly large blocks out of the finely dispersed 
particles." 

The role of the hydrogen in each of the stages of the 100C hysteresis 
loop shown in Fig. 10 has been clearly outlined by WRIGHT. The 
hydrogen in solid solution in the lattice will ionize, and the proton or 
hydrogen ion will increase the resistance by disturbing the periodicity 
of the lattice, while the electron will enter the unfilled d band of the 
palladium. These are opposing effects, the former tending to increase 
the resistivity, and the latter to decrease the resistivity. From the 
close correspondence between the concentration isotherm (Fig. 11) and 
the resistance isotherm, WRIGHT has deduced that the scattering 
centres provided by the hydrogen ions have a greater effect and, there- 
fore, increase the resistance of the palladium-hydrogen system. It was 
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also indicated that the increase in resistance due to cold working is 
small compared with this alloying effect, so that lattice distortion at the 
phase boundaries will be relatively unimportant in altering the resis- 
tance. Finally, the effect on the resistivity of a new phase in the system 
will be greatest when the phase particles are of comparable dimensions 
to the wavelength of the conduction electrons. 

In the first stage of the resistance-pressure isotherm at 100C, the 
relatively greater effect due to decreased periodicity of the lattice 
caused by the hydrogen ions entering the metal, over the effect of the 
electrons entering the d shell, will result in an increase in the resistivity. 
When the beta phase starts to form at about 27 cm Hg, there is a slight 
increase of resistance due to strains at the phase boundaries. At about 
31 cm Hg, the system transforms completely to the beta phase. If this 
transformation were brought about by very small increases in hydrogen 
pressure with alternate long time periods allowed for equilibrium condi- 
tions to be reached, the vertical portion of the curve would become 
almost parallel to the resistance axis. In the beta phase, the d shell is 
filled and an increase in pressure appears to cause no further marked 
change in the resistivity up to normal pressure. By comparing Fig. 10 
with Fig. 11, one sees that there is an increase in resistance as the 
hydrogen content is decreased, i.e. as the pressure over the beta phase 
is lowered from normal pressure to 17 cm Hg. Since the alpha phase 
does not form until the pressure is decreased to about 19 cm Hg, there 
can be no increase in resistance due to strains, but WRIGHT has attri- 
buted it to an increase in the number of positive holes in the d band of 
palladium. As the pressure is decreased to 17 cm Hg, the system changes 
to the alpha phase. With the removal of hydrogen the resistance drops, 
since the fall due to reduced hydrogen scattering is a more important 
factor than the removal of electrons from the d shell which tends to 
increase resistance. Under ideal conditions this decrease in resistance 
would be shown as a line parallel to the resistance axis. This seems to 
the present writer to be a reasonable explanation of the phenomena 
which have been observed. 

Some time ago ANDREWS 142 used electrical resistance measurements 
in conjunction with pressure-volume relationships to study the reaction 
of nitrogen, oxygen and hydrogen with tantalum. Fig. 12 shows the 
variation of resistance at room temperature with gas content of the 
metal. Unfortunately, no theory was suggested to explain these curves. 
It was found that in nitrogen, absorption increased with increased gas 
pressure until about 100 relative volumes were absorbed. At this point 
the equilibrium pressure dropped as absorption increased, although this 
had no effect on the resistivity. It was noted that the resistance at 
room temperature rises less rapidly with hydrogen concentration than 
for either oxygen or nitrogen. The reason given was : "It can be 
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assumed that this is due to the small atomic volume of the hydrogen, 
so that a given number of molecules dissolved in the metal would strain 
the lattice less than larger atoms." Probably ANDREWS meant that a 
given number of atoms of hydrogen would strain the lattice less than 
oxygen or nitrogen. It is now generally accepted that the dissolved 
gas is not present as molecules. The sudden change in slope of the 

oxygen curve was attributed 

/I I I I I I I to the formation of an oxide 
compound, but no further 
explanation was given. 

Later, WEBBER, 143 basing 
his experimental treatment 
on the results of ANDREWS' 
work, showed that gas content 
(particularly oxygen and 
nitrogen since the specimens 
were annealed in air) is not 
a major factor affecting the 
superconductivity of tanta- 
lum. In this case the gases 
were absorbed from the gas 
phase. 

In more recent work GOLIK, 
LAZAREV and KnoTKEVicn 144 
have found that the temper- 
ature range of supraconduc- 
tive transition is broadened 
as a result of electrolyticaliy 
charging tantalum with 
hydrogen. The broadening 
increases with the amount of hydrogen in the tantalum. At any 
given temperature between 1-85 and 4-2K, the electrical resistance 
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was found to increase with the amount of occluded 



hydrogen. At temperatures down to 1-86K, it was found that 
tantalum specimens could be made to lose supraconductivity altogether 
by vsaturation with hydrogen. The upper limit of the transition 
range remained unchanged, all samples showing a marked decrease 
in resistivity at that point. The condition of the specimens appeared 
to control the amount of hydrogen which could be occluded. For 
example, 0-340 mg of hydrogen was required to suppress supraconduc- 
tivity in a new tantalum sample. This wire was heated 10 hours at 
1700C in vacua, and its supraconductivity was completely restored. 
After this anneal, only 0-005 mg of hydrogen was required to suppress 
supraconductivity. In contrast to tantalum, it was observed that 
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the supraconductivity of niobium was held at 4-2K after charging 
with hydrogen. It was concluded that this effect of hydrogen on 
tantalum could not be due to simple expansion of the lattice, but must 
be due to the formation of solid-solution alloys. 

DRAVNiEKS 145 used the change in electrical resistance as a means of 
measuring the rates of reaction of zirconium and copper with both 
ionized and non-ionized gases. The study was mainly one of surface 
reactions such as oxidation. The resistance changes noted in the 
copper specimens were taken to be caused 
by the formation of a cuprous oxide film 
which is practically non-conducting com- 
pared with copper. The conductivity of 
the zirconium changed as the reaction 
with gas proceeded due to : (a) the forma- 
tion of a zirconium-gas solid solution of 
lower conductivity than the pure zirconium, 
and (6) the formation of an oxide scale of 
zirconium. The change of resistivity was 
used merely as a measuring device, and 
again 110 theory was proposed to account 
for the effect. 

Recently JAFFEE and CAMPBELL 146 have 
reported that both oxygen and nitrogen 
raise the resistivity of titanium to the 
same extent per atomic per cent. Hydrogen 
up to one atomic per cent does not seem to 
change the resistivity. The results obtained 
are shown in Fig. 13. 

There seem to have been relatively few 
studies reported in the literature on the 
effect of gases on the electrical resistivity of pure metals other than 
palladium. Such work might prove to be very significant in showing 
the nature of the interaction of gases with metals. 

One might expect dissolved oxygen to lower the conductivity of pure 
copper. It will be shown later in this section that oxygen will increase 
the conductivity of slightly impure copper by removing the impurities 
from solid solution. SMAKT, SMITH and PHILLIPS 147 have shown that 
there is a very small decrease in the conductivity of pure copper due to 
dissolved oxygen. Their results are shown in Table 4 on page 144. 

It is seen that there does seem to be a loss in conductivity due to the 
dissolved oxygen, but the relationship between the data obtained at the 
two annealing temperatures is inconclusive. 

In the course of work on internal oxidation, MEIJERING and DRTJY- 
VESTEYN 148 determined the effect on the electrical resistance of annealing 
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TABLE 4 

Conductivity of Samples Annealed for One Hour at Indicated 
Temperatures and Cooled by Rapid Quenching 







Conductivity per cent 


Sample No. 


Description 








Annealed at 500C 


Annealed at 850C 


A 


Oxygon -free 


102-32J 




B 


Oxygen-bearing 


102-231 


102-lOf 




(saturated at 850"C) 






C 


Oxygen -bearing 


102-21J 


102-23f 




(Check run) 






D* 


Oxygen -free 


102-39+ 




E* 


Oxygen -bearing 


102-19| 





* Measurements on these samples were made in another laboratory. 
t Annealed in nitrogen, 
t Annealed in hydrogen. 

silver- and copper-base alloys in oxygen. Table 5 gives in semi-quanti- 
tative terms the results of this work. Factors such as oxidation tempera- 
ture, and time of annealing before or after oxidation affect the resistivity. 
These factors are not considered in the table so the results are semi- 
quantitative. 

TABLE 5 

Resistivity in 10~ 6 Cl cm at 0C of Annealed (pa,) and Oxidized (po) Silver and 

Copper Alloys 



Description 


of Allot/ 


pa 


po 


po pa 
inn 


1 ' . i\j\j 










pa 


Silver -f 0-33 per cent magnesium 


2-05 


2-32 


-f- 13 per cent 


Silver -f 0-15 , 


beryllium 


1-56 


1-92 


+ 23 ., 


Silver -f 0-26 , 


aluminium 


3-45 


2-39 


- 31 


Silver + 0-60 , 


titanium 


2-39 


1-94 


- 19 


Silver -f- 0-79 , 


zinc 


2-05 


1-65 


~ 20 


Pure silver* 




1-75 


1-57 


- 10 


Copper -f- 0-75 per cent magnesium 


2-30 


1-72 


- 25 


Copper + 0-10 


beryllium 


2-08 


1-70 


- 18 


Copper + 0-69 


beryllium 


2-95 


1-76 


- 40 


Copper + 0-81 , 


aluminium 


3-17 


1-78 


- 44 


Copper + 1-1 ,, , 


titanium 


8-45 


1-78 


- 79 


Pure copper 




1-61 


1-62 






The percentages are weight per cent. The silver-zinc alloy was annealed in nitrogen 
to cut down evaporation of zinc. 
* Evidently not very pure. 

In the case of the copper alloys, the resistance after oxidation of the 
alloy approaches that of pure copper. Only the non-hardening zinc 
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alloy in the silver-base group decreased to the resistivity of pure silver. 
The alloys with titanium and aluminium had a resistivity well above 
that of pure silver and those of magnesium and beryllium increased 
above the value for the annealed alloy. 

The two main factors which influence the resistance have been 
described by MEIJERING and DRUYVESTEYN as : 

"(1) Internal strains cause generally an increase of the resistance as 
the lattice is distorted irregularly by these strains. This effect causes a 
decrease of the resistance by annealing. TAMMANN and DREYER 149 * 
found that by annealing after 98 per cent rolling the resistance of pure 
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Fig. 14. Effect of atmospheric exposure on electrical conductivity of a 

Mg- Al-/n alloy. Such exposure results in the addition of hydrogen to the 

magnesium lattice (Busic and BoBALEK 119 ). 

silver decreases by 5 per cent, the decrease of pure rolled copper after 
annealing being only 1-6 per cent. 

"(2) A small number of foreign ions, substituted on the normal 
lattice places, increases the resistance, this effect being for copper and 
silver alloys the larger the higher the valency of the alloying element, 150 * 
as ions with a high electric charge will scatter the electron waves con- 
siderably. When precipitation occurs and larger particles are formed the 
resistance will decrease. It is uncertain whether the scattering of 
electron waves in a silver lattice is highest on magnesium ions or on 
magnesium oxide molecules." 

Thus oxidation of the alloys may be expected to involve both these 
factors to some degree. It was suggested that the lattice distortion on 
oxidation would probably increase, causing an increased resistivity, 
while the electron scattering near the solute ions probably decreases, 
tending to cause a decreased resistivity. If, on oxidation, very large 

* This number refers to the list of references at the end of the present paper. 
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particles of oxide are precipitated then the strain in the lattice will not 
be sufficient to increase the resistivity. If the scattering of electrons 
near the solute ions is large as in the case of trivalent or quadrivalent 
ions like aluminium and titanium, then there will be a marked tendency 
for the resistivity to decrease when the solute element is oxidized. In 
silver alloys with magnesium and beryllium in solution, the increase in 
resistance, due to increased distortion in the lattice after oxidation, 
appears to be a more effective factor than the tendency to decrease the 
resistivity by decreasing the electron scattering. In all other cases the 
effect of the decrease of electron scattering outweighs the distortion 
effect. 

Also considering alloys, BUSK and BoBALEK 119 determined the effect 
of hydrogen on the conductivity of magnesium alloys. The results of 
their investigation are shown in Fig. 14. The last point on the diagram 
was obtained after the specimen was saturated with hydrogen after it 
was charged for 4 hours in an alkaline solution. Prior to saturation the 
specimen was exposed to a normal atmosphere for the times shown, 
during which period hydrogen was absorbed by the lattice as indicated 
in another experiment. No suggestions were given to account for the 
curve. It was shown that the initial rise in conductivity was not due to 
the precipitation of a second phase during the initial stages of gassing. 

Miscellaneous Effects 

Other interesting effects caused by gases in various metals are discussed 
in the remainder of this section. Since each effect is a relatively isolated 
phenomenon, only a small amount of study having been devoted to 
each one, this section has been separated from the others. 

The analogy between the solid solution of gases in metals, and liquid 
solutions has been demonstrated by experiments showing that the 
electrolysis of these solid solutions is quite possible. COEHN and his 
co-workers studied the hydrogen-palladium system 151 and SEITH and 
KuBASCHEWSKi 152 reported a similar effect in the iron-carbon system. 
In each case the non-metallic element moves in the solid metal as a 
positive ion under the influence of the electric field. This agrees with 
the idea that hydrogen is ionized to form an electron and a positive 
hydrogen ion when it enters the metal. More recently DE BOER and 
FAST 50 have found that under suitable conditions of temperature and 
applied potential difference, the oxygen in solution in zirconium wires 
could be made to diffuse through the metal lattice as a negative ion. 
In one experiment a wire with 5 atomic per cent of dissolved oxygen 
was heated to 1640C by passing direct current through it. After some 
hours the temperature had dropped at the negative pole and had risen 
to a white heat at the positive end of the wire. Subsequent x-ray 
analysis indicated a concentration of 16 atomic per cent of oxygen at 
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the positive end and a gradual transition to almost pure zirconium at 
the negative end. In another experiment the oxygen was driven back 
and forth at will, simply by reversing the polarity of the applied poten- 
tial. It was predicted by BARKER 3 that oxygen would behave as a 
negative ion. 

In the same work 50 it was shown that the lattice constants a and c 
of the hexagonal zirconium lattice both increase up to 10 atomic per 
cent of oxygen. Beyond that point only the c value increases. Measure- 
ments were made on specimens containing up to 30 atomic per cent 
oxygen. This greater increase of c is similar to the case for titanium 
reported more recently by CLARK, 118 although CLARK measured only 
to 0-5 per cent of dissolved gas. The specific gravity of the zirconium- 
oxygen samples was measured and it was found that this increased over 
5 per cent while the volume increased less than 2 per cent after absorp- 
tion of 30 atomic per cent oxygen. A calculation of the specific gravity 
was made assuming that all the oxygen is contained in the interstices 
of the zirconium. The calculated values agreed very well with the 
experimental data. From all this information, arid from previous work 
by FAST, it was concluded that all oxygen atoms are contained in solid 
solution at interstitial positions. More recently, HALL, MARTIN and 
REES 153 have confirmed by x-ray analysis that in samples containing 
up to 36 atomic per cent oxygen, the oxygen was in true solid solution 
in the zirconium. 



JAEGER, ROSENBOHM and VEENSTRA 154 have reported that the 
specific heat of silver freshly melted in air, or heated in air at 940C 
for 5 to 10 hours was higher than that of silver which had been melted 
in a vacuum, or had been stored for several months after melting in air. 
Their results showed a marked change in specific heat when gas -free 
silver was heated in air above 650C. This effect was ascribed to the 
presence of dissolved oxygen. 

Later, KELLEY I&& determined the specific heat of pure tantalum, and 
of tantalum containing small amounts of hydrogen in solution over the 
temperature range 80K to 295K. The tantalum containing hydrogen 
behaved anomalously, the specific heat having a high maximum in the 
region 220 to 265K, depending on the amount of hydrogen in solution. 
The height of the maximum and the difference in measurable heat 
content between pure tantalum and that containing hydrogen were 
found to be about proportional to the amount of hydrogen in solution. 
The temperature at which the maximum occurs increases as the 
hydrogen content increases. When specimens containing hydrogen 
were degassed the specific heat returned to the normal value for pure 
tantalum. KELLEY has postulated that differences in entropy between 
hydrogenated and gas-free tantalum may be attributed to differences in 
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the number of possible ways of arranging the hydrogen ions in the 
tantalum lattice at the higher and lower temperatures. 



The effects of gases on the mechanical properties of metals will not 
be considered in great detail here. It is felt that much of the experi- 
mental work has been done on metals having metallic impurities which 
would make the results of the work insignificant. In some cases 
unsuspected internal oxidation may have occurred which would seriously 
distort the results of the experiment. In all cases where a second phase 

is formed (oxide or nitride) the hardening 
or embrittling effect is well known and 
need not be discussed. The effect of 
hydrogen, oxygen or nitrogen in true 
solution in those metals which only dis- 
solve small amounts of the gas does not 
seem to be really well known. Of those 
metals which dissolve relatively large 
amounts of gas, titanium has been the 
subject of the most intensive study. 

JAFFEE and CAMPBELL 146 have studied 
the effect of oxygen, nitrogen and hydro- 
gen on the mechanical properties of 
iodide-refined titanium. Fig. 15 shows 
a plot of the average cross-sectional 
hardness of the specimens versus their 
gas contents. The results indicate that 
nitrogen may harden titanium to a 
slightly greater extent than oxygen, while 
hydrogen actually appears to decrease 
the hardness slightly. This apparent 
decrease was felt to be due to hardness 
variations in the original titanium rod. 
Fig. 16 shows that, for the range of 
solubility considered, nitrogen increases the tensile strength more 
than oxygen, and correspondingly decreases the ductility more. The 
apparent high ductility found with 0-5 per cent of hydrogen was felt 
to be due to specimen variation and not to any effect of the gas. 

FINLAY and SNYDER 156 have recently studied the effect of nitrogen, 
oxygen and carbon on the mechanical properties of titanium. This 
work is similar to that discussed above but is more complete. In 
general this more recent study has confirmed the results obtained by 
JAFFEE and CAMPBELL. In contrast to the substitutional alloys, where 
the solute element with the lowest solubility has the greatest strengthen- 
ing effect per unit concentration, it was found here that the interstitial 
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element with the lowest solubility of the three studied (carbon) has the 
lowest strengthening effect. However, it was also noted that the 
element (oxygen) with the highest solubility does not have the greatest 
hardening effect. Also JAFFEE and CAMPBELL showed that hydrogen, 
with a greater solubility in titanium than carbon, oxygen or nitrogen, 
has almost no strengthening effect. There does not appear to be suffi- 
cient information available to make a generalization at this time. 
FINLAY and SNYDER have studied the relationship between the amount 
of hardening and the extent of lattice-parameter increase effected by 
additions of the interstitial solute elements. This relationship is shown 
in Pig. 17. It was suggested that oxygen and nitrogen may behave 
similarly in their interactions with the bonding electrons in the alpha 
solid solution, thus accounting for the correlation between mechanical 
properties and parameter changes. 



A recent paper by WAIN and CoTTRELL 157 describes one of the most 
interesting effects which gases have on metals. In this work it was 
shown that zinc crystals containing nitrogen have a sharp yield point 
when sufficiently strain-aged, while no yield point was found in nitrogen- 
free zinc. References are given in the paper to others which have shown 
that nitrogen also causes a yield point in iron, cadmium single crystals, 
and molybdenum polycrystals. OROWAN had observed a similar effect 
in zinc which he termed "thermal hardening," but he did not attribute 
the yield point to the presence of a specific solute element. Unlike 
cadmium crystals, the zinc crystals would not absorb sufficient nitrogen 
from the atmosphere during growth, so the nitrogen had to be intro- 
duced in a preliminary operation, by melting the zinc under a flux or by 
bubbling the gas through the zinc. It was definitely shown that the 
effect was due to nitrogen present internally and not merely as a surface 
film. Stress-strain curves, made after various ageing treatments at 
room temperature, show that the elastic limit decreases due to recovery 
and the yield point does not appear for several hours. Almost complete 
recovery precedes the return of the yield point, which appears after 
ageing for 8 or 9 hours at room temperature. Ageing for longer time 
periods causes an increase in the yield point. At higher temperatures 
the yield point appeared after shorter ageing times. From experimental 
results it was deduced that the process is thermally activated, with an 
activation energy of 21,000 cal/mole. The orientation of the crystals 
was found to have no effect on the magnitude of the yield point. The 
theory presented to explain this effect was that : ". . . the yield point 
is caused by the segregation of solute atoms to dislocations. The 
attraction of the dislocations to the segregated atoms provides a bond 
which has to be broken by a larger force than is necessary to maintain 
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freed dislocations in motion; the material thus gives way suddenly 
and softens, at the start of plastic flow, producing a sharp yield point. 
The theory also explains the observed removal of the yield point by 
plastic overstrain and its return on strain-ageing. A freshly strained 
specimen contains freed dislocations and does not show a yield point, 
but on ageing these dislocations become anchored by the migration of 
solute atoms to them and the yield point returns." 

It is believed that, in each of the metals in which the yield pheno- 
menon has occurred, the nitrogen is present in interstitial solution so 
that it distorts the lattice of the metal and diffuses very rapidly, condi- 
tions which must be fulfilled to agree with the theory suggested. It was 
predicted that marked yield points may not be found in face-centred 
cubic metals "because of the difficulty of anchoring screw dislocations 
in this type of lattice." 

It was previously pointed out that one consequence of the theory of 
nitrogen atoms migrating to dislocations (in alpha-iron) is that the 
electrical resistance should decrease by a small, predictable amount on 
strain-ageing. This effect was examined by COTTRELL and CHURCH- 
MAN, 158 and was found to exist. 

SUBSURFACE OXIDATION OR INTERNAL OXIDATION 

Although, as mentioned earlier, surface oxidation will not be discussed, 
the phenomenon of internal oxidation is considered to be within the 
scope of the present review. 

Generally, if an alloy contains a solute element with greater affinity 
for oxygen that the solvent metal, the minor constituent may oxidize 
below a surface oxide formation, in an area where the partial oxygen 
pressure is too low to cause oxidation of the major element. This 
subsurface oxide will not flake off as may the surface layer of oxide of 
the solvent metal. 

Although this effect was observed by TURNER 159 as far back as 1912, 
relatively little has been published until recently. Although they both 
observed internal oxidation in commercial copper during annealing, 
BLAZEY 160 and later WYMAN IGI did not consider this to be as important 
as other factors causing embrittlement. C. S. SMITH 162 is responsible 
for the term "subscale" which is used to denote the layer of subsurface 
oxide, and he appears to have been the first to make quantitative 
observations of the phenomenon. FROHLicii 163 has remarked on the 
occurrence of internal oxidation of several copper alloys but did not 
consider the problem in detail. LEROUX and RAUB 164 found a precipita- 
tion structure of both cupric and cuprous oxide in silver-copper alloys. 
The cupric oxide occurred in alloys poor in copper, and the cuprous 
oxide in those rich in copper. Most of the experimental work in this 
field has been done with copper as the solvent metal, but MEUERiNG 165 
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describes the properties of alloys of silver, copper, nickel, and iron 
respectively, with small amounts of other metals, after treatments 
leading to internal oxidation. 

In one paper, RniNES 60 has classified the modes of occurrence of 
subsurface oxidation as one or a combination of three types: "(I) 
general precipitation of separate particles of the oxide of the alloying 
element within a matrix of the major metal, (2) deposition of relatively 
thin layers of the oxide of the alloying element along the grain boun- 
daries or special planes of the major metal, or (3) envelopment of 
particles of the major component in a matrix of the oxide of the 
alloying element." The first type, which is generally indicated when 
one mentions "internal oxidation," occurs in dilute alloys at high 
temperatures. The second type occurs at lower temperatures with 
greater as well as smaller amounts of the alloying metal. The third 
type can only occur when there is a relatively large amount of the 
oxidized metal present. In the same paper RHINES has described some 
very interesting practical aspects of subsurface oxidation. 

The actual mechanism of internal oxidation seems to be quite simple. 
As prerequisites for internal oxidation to occur, RniNES 166 has indicated 
that there must be a solubility of oxygen in the alloy and the oxygen 
must be able to diffuse fairly rapidly into the alloy. The minor con- 
stituent must form an oxide which is more stable than the oxide of the 
major component, and which is insoluble in the alloy. If one considers 
the system as a ternary alloy, then the constitution of the system must 
allow a quasibinary equilibrium to occur between the solid solvent and 
the oxide particles at the oxidation temperature. 

Fig. 18 shows typical internal oxidation. At the bottom of the 
picture one may see a relatively thin dark layer which probably consists 
mainly of cuprous oxide in this case, with an oxide of manganese. This 
formation is not actually described by RHINES, but would be analogous 
to one of a copper-silicon alloy described in an earlier paper. The 
second, and larger, region consists of copper-manganese solid solution 
plus a precipitate of a manganese oxide. It is this layer which is the 
area of internal oxidation. Finally, the region at the top of the photo- 
graph is simply a copper-manganese solid solution. By oxidizing at a 
low oxygen pressure a subscale may be formed without oxidizing any 
of the solvent metal, thus leaving an apparently clean surface even 
though a considerable depth of oxidation has occurred. This effect is 
shown in Fig. 19. 

To explain the mechanism of internal oxidation RniNES 166 has used 
the diagram shown in Fig. 20. 

As has been noted earlier in this paper, there is a considerable 
solubility of oxygen in copper. Thus it is possible for the diffusing 
oxygen in the metallic lattice to come into contact with the alloying 
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Fig. IS. Typical Internal Oxidation. Alloy of copper f 04 per cent 
manganese oxidized at 1000 C in air for 8 hours X 50 (KniNEs 9 ). 




Fig. 19. Internal oxidation in alloy of copper with 0-103 per cent 

silicon X 50. Sample oxidized at low oxygen pressure has no 

external scale. Internal oxidation is marked by zone crossing lower 

half of picture (RiiiNEs 166 ). 
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element. If the oxide of the alloying element is more stable than 
cuprous oxide, under the conditions of temperature and oxygen pressure 
imposed, then the solute metal oxide will form. This preferential 
oxidation of the alloying element may take place beneath the external 
oxide scale or, if the external oxygen pressure is not sufficient to form a 
scale, it may extend to the apparently clean surface (Fig. 19). Once the 
solute metal oxide has formed it will not remain in solution very long 
but will separate from the lattice as a precipitate. As the solute metal 
is exhausted by this precipitation, the oxygen supplied to the surface 
of the subscale (either from the external atmosphere or from the copper 
oxide layer) diffuses farther into the alloy and thus produces a zone of 
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Fig. 20. Schematic representation, of distribution of dissolved oxygen 
and alloying element A in subscale and unoxidized solid solution 

(RlIINES 106 ). 

ever-increasing thickness. At the same time, the solute metal com- 
ponent must diffuse in the opposite direction toward the advancing 
front of scale to replace the decreasing concentration of solute atoms 
being precipitated as oxide particles. Thus RHiNES 9 points out that 
the rate of subscale growth depends, among other conditions, upon the 
diffusion rates of both oxygon and the reacting solute element. To 
confirm this theory, it has been found that in many alloy systems, the 
square of the subscale depth is a linear function of the time of reaction 
at constant temperature. It was shown in another paper 167 that, if 
melting does not occur, internal oxidation of copper alloys occurs 
between 750 and 10000 as a relatively simple diffusion process in- 
volving the movement of gas and metal atoms. This process was des- 
cribed by mathematical expressions. The oxidation process investi- 
gated at 600C was found to be more complex. DARKEN 168 has also 
discussed the mechanism of subscale formation. He has shown how the 
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rate of growth and composition of the zone may be predicted from a 
knowledge of diffusion constants, phase relations, and boundary 
conditions. 

BHINES found that the particle size of the precipitate depends upon 
the chemical nature arid concentration of the solute element and the 
temperature of oxidation. The more stable oxides tend to form fine 
precipitates, while the less stable ones precipitate as coarse particles. 
High temperature and high concentrations of the alloying element 
both promote large particle size. Usually the oxide particles are well 
distributed throughout the zone of oxidation. 

The precipitate distribution depends on the system and the tempera- 
ture of oxidation. In some cases the density of oxide particles is greater 
near the metal surface, in other cases the reverse is true, depending on 
the relative diffusion rates. As the temperature of oxidation is lowered 
the oxide particles tend more to occur at the grain boundaries rather 
than in the uniform distribution characteristic of high temperature 
oxidation. 

HHINES has also found the rale, of formation of the zone of oxidation 
to decrease with time, with an increased content of the solute clement, 
with lower temperatures and with decreased oxygen pressure. The 
time to form a given thickness of subscale increases almost directly 
with increase in the concentration of the solute metal. 

In ternary alloys, where the two solute elements form oxides of 
different solubilities, double zones of internal oxidation occur. The 
outer zone contains the oxides of both alloying elements, while the 
inner zone contains only the more stable oxide of the two. .By altering 
the alloy composition one may change the dimensions of the subscale. 
By changing the oxygen pressure selective oxidation of the solute 
element forming the most stable oxide may be promoted. 

In 1947 RniNES 169 summarized his work, and in the review mentioned 
that the surface hardening of alloys due to the presence of uniformly 
distributed oxide precipitate does not seem to be possible. However, in 
1945, CHASTON 170 had already noted what he called "precipitation- 
hardening effects" and used associated properties to develop his theory 
of water vapour embrittlement of silver (see section on Embrittlement). 
CHASTON was able, by careful oxidation, to obtain a marked increase 
in the Vickers hardness of silver alloys with small amounts of aluminium 
or zinc in solution. It was noted that there is an optimum solute con- 
tent required to give the maximum hardening effect for a given oxida- 
tion treatment. Larger amounts of the solute in solution appear to 
hinder the diffusion of oxygen into the subsurface layers of metal. It 
was found that the depth of the oxide subscale was greater in alloys of 
low solute element content, and the depth could be increased by 
increasing the oxidation time. Commercially pure silver could be 
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hardened by heating in air followed by heating in an inert atmosphere 
containing zinc vapour. The zinc vapour apparently diffused into the 
silver to combine with the dissolved oxygen, thus causing a hardening 
effect. Spectroscopically pure silver, when solid, could not be hardened 
in this way, since it will not, according to CIIASTON, dissolve oxygen 
(see section on Embrittlement). Similar hardening was obtained by 
heating silver in air containing zinc vapour. In this case the zone of 
internal oxidation formed by heating silver for 4 hours in air containing 
zinc vapour was found to extend 0-04 in. below the metal surface. 
Shortly after RHINES' review was published, MEIJERING and DRUY- 
VESTEYN 148 ' 165 > m reported their finding substantial surface hardening 
caused by internal oxidation in certain binary alloys. This hardening 
effect has recently been confirmed by SMITH and DEWHIRST. 172 

In their paper METJERING and DRUYVESTEYN point out that "disper- 
sion hardening" by blocking the glide planes with minute particles is 
effectively carried out by internal oxidation. By heating strips of silver 
alloys for 2 hours in air at 8()0C they obtained the following results : 

Silver -f- 0-3 weight per cent magnesium : { : 170 Vickors (kg/mm 2 , load : 2$ kg) 
Silver -f- 0-4 weight per cent aluminium ! : 160 Vickers 
Silver -f 0-2 weight per cent beryllium 1- 135 Viokors 

Similar strips annealed in nitrogen or hydrogen had hardnesses of from 
30 to 50 Vickers. 

The success of a hardening operation depends on the difference in 
stability of the oxides of the two components of the metal alloy. 
Experimental results have shown that although the oxidation of a given 
percentage of a solute element will increase the hardness of silver, the 
same treatment may not harden copper to the same extent. Similarly 
an oxidation treatment which hardens a copper-base alloy may not 
harden a nickel-base alloy due to the greater stability of nickel oxide. 
The difference between hardening and non -hardening alloys has been 
explained on a thermodynamical basis by METJERING and DRUYVESTE YN. 

With respect to the precipitation of the oxide particles, on which 
depends the resultant hardening effect, METJERING and DRUYVESTEYN 
have shown that at very high magnifications (1200 x ) no particles were 
visible. The oxide will be first formed in a molecular state, the molecule 
being immobile in the alloy lattice. After some time period this 
molecule may dissociate (the rate decreasing with increase in stability 
of the oxide) and the atoms of the oxide will move to "form greater 
(and, therefore, more stable) complexes." The less stable oxides produce 
coarser particles which have practically no hardening effect. 

X-ray measurements showed that the lattice spacing of silver con- 
taining 0-36 per cent magnesium, and silver containing 0-26 per cent 
aluminium increases considerably by oxidation to a value larger than 
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that of either pure silver or of the unoxidized alloy. The influence of 
dissolved oxygen in the silver is quite small. This result was interpreted 
to indicate that a considerable part of the magnesium and aluminium 
oxides are dissolved in almost molecular dispersion in the silver lattice. 
Those alloys of silver with zinc and cadmium which did not harden had 
a spacing about the same as that of pure silver, which indicated the 
formation of larger oxide particles. Copper alloys which have been 
hardened retain a lattice spacing almost equal to that of pure copper. 

Electrical resistance measurements made by MEIJERING and DRUY- 
VESTEYN show that the resistivity of the oxidized copper alloys decreases 
to a value approaching that of pure copper. In the silver alloys, the 
hardened specimens had a resistivity higher than pure silver (in two 
cases, with magnesium and beryllium, the resistivity increased on 
oxidation) and only the non-hardenable zinc alloy decreased to the 
value of pure silver. This high resistivity of the hardened metals 
has been assigned to two causes : (a) the silver lattice may be distorted, 
and (b) the electric field near a foreign ion or atom may differ from that 
near a silver ion. On oxidation the distortion effect is increased by 
the oxide molecules on the lattice. In cases other than magnesium and 
beryllium cited above, the slight decrease in resistivity is caused by 
annealing which may in some cases be larger than the effect caused by 
the oxide particles. 

Generally it was found that serious embrittlement is caused in poly- 
crystalline specimens which have been hardened to about 140 Vickers 
or more. The elongation in tension is often reduced to 1 or 2 per cent. 
However, since the brittleness was of an intercrystalline nature, single 
crystals hardened by oxidation remained quite ductile (28 per cent and 
21 per cent elongation). 

In defence of his failure to recognize the hardening effect, RuiNES 9 
attributes the cause to the fact that he used a "micro-character scratch 
hardness tester" while other investigators used micro-indentation 
measurements. From this he concludes that there is no matrix harden- 
ing but an "aggregate hardening effect, which is less likely to be reported 
by a scratch test." This conclusion is quite at variance with the x-ray 
measurements made by MEIJERING and DRUYVESTEYN 148 which 
indicated a lattice expansion of the alloys which had been hardened. 

On the basis of unpublished work done by THOMAS, 173 RniNES 9 
states that: "It has been supposed, largely upon the basis of studies 
by FROHLICH, 163 * that internal oxidation is not to be expected in alloys 
in which the minor element is noble with respect to the major element. 
An exception to this has been reported recently by THOMAS, 173 * who 
studied the oxidation behaviour of copper-palladium and copper- 
platinum alloys. Subscales were found in alloys containing as little as 

* This number refers to the list of references at the end of the present paper. 
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5 per cent of the noble metal. The oxide of the subscale was, in these 
cases, composed of cuprous oxide containing the noble metal in solid 
solution." From the brief description given by RHINES it does not seem 
as if this phenomenon is one of internal oxidation in the usual sense of 
the term. It has been usually accepted that the minor component of 
a binary alloy must behave according to the requirements set out earlier 
in this section, i.e. it must oxidize preferentially to the major constituent 
etc. This is not the case here, where the major component is oxidized. 
It has not been made clear by RHINES how the noble metals are in any 
way responsible for this "internal oxidation." This phenomenon may 
be some secondary effect due to the presence of the noble metals, but 
they are not directly responsible for the "internal oxidation." 

GAS EVOLUTION ON SOLIDIFICATION OF METALS 

Although this review is not concerned with gases in molten metals, it 
may be well to consider briefly the effect which gases, evolved during 
solidification of metals, have on the resultant solid metals. The 
evolution may be caused simply by the fact that molten metal at the 
freezing point will absorb much larger quantities of gas than the same 
solid metal at the melting point. On solidification of the metal, the 
excess of gas comes out of solution, forms bubbles, and is trapped as 
minute cavities or blowholes in the casting. Hydrogen generally 
appears to be the major offender in causing defects of this type, parti- 
cularly in iron, copper, nickel, and aluminium. Oxygen in silver causes 
a similar effect, known as "sprouting." 

If the metal contains dissolved oxygen as well as hydrogen, then on 
solidification of the metal, these gases may react under certain conditions 
to form steam. This "reaction unsoundness," as it is called, has been 
studied by ALLEN and his co-workers. In a very detailed study, 
ALLEN 174 described the porosity due to the simple evolution of hydrogen 
from copper, and went on to say that when oxygen was present as well 
as hydrogen the voids were smaller, more numerous, and occurred in 
the crystal boundaries. The amounts of oxygen and hydrogen required 
to cause this unsoundness were very small, of the order of 0-002 per cent 
and 0-0001 per cent by weight respectively. ALLEN concluded also 
that : "The phenomenon results from a reaction, taking place during the 
process of freezing, between cuprous oxide and minute residual traces of 
hydrogen, whereby steam is generated. . . . Cuprous oxide and 
hydrogen can exist together in molten copper." Sulphur would have 
the same effect as hydrogen in reducing the oxide of copper. ALLEN 
suggested that the "reaction unsoundness" phenomenon is more 
important commercially than simple gas evolution. In a later paper 
ALLEN and STREET 42 described similar effects in copper-nickel alloys. 
It was concluded that unsoundness could be caused by an amount of 
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hydrogen below that needed for pure hydrogen unsoundness provided 
that the oxygen content was greater than about 0-001 per cent for alloys 
containing less than 25 per cent nickel, and about 0-01 per cent for 
alloys of more than 30 per cent nickel. This critical oxygen content is 
related to the solid solubility of oxygen in the alloys. ALLEN I?S also 
studied the effect of pressure on the evolution of gases from metals, 
using the silver-oxygen system as an example. 

It was stated in the introduction to the present paper that there would 
be no discussion on degassing. However, for the reader who may be 
concerned with this problem, there is a section in the supplementary 
reference list which gives a few of the many papers which have been 
written on porosity, unsoundness and degassing. The very brief 
description of the problem which has been given here is not intended 
to be a review of the latest work as much as it is an introduction to a 
commercially important aspect of the gas-metal problem. To the 
person interested in physical metallurgy there may not be as much 
interest in this problem as to the foundryman. At any rate further 
details may be found in the papers referred to in the supplementary 
list. Since many of the papers discuss both porosity and degassing, no 
attempt has been made here to indicate which papers deal with any 
particular study. 

EMBKITTLEMENT OF METALS BY GASES 

The phenomenon of em britt lenient of metals by gases has long been a 
problem in the metal industry and seems to re-occur' as an unsolved 
problem at various times. Here the mechanism of gas embrittlement will 
be discussed, and the causes mentioned. The remedies will not be 
considered here, but may be found in some of the papers which are 
referred to below. 

At the outset, the writer wishes to emphasize a point which was 
mentioned by MARTIN and PARKER. 176 The term * 'hydrogen embrittle- 
ment," which is correctly used in referring to iron and steel, for 
example, becomes a misnomer when it refers to the common embrittle- 
ment of copper and silver. This latter phenomenon will be referred to 
here as "water vapour embrittlement" in recognition of the fact that 
both hydrogen and oxygen are necessary to cause embrittlement in 
solid copper. Water vapour embrittlement will be discussed first, and 
true hydrogen embrittlement will be considered later. 

In a series of papers, WYMAN, 177 in discussing copper embrittlement, 
has shown that alternately annealing in oxidizing and reducing 
atmospheres causes water vapour embrittlement. It was thought that 
during the oxidizing cycle, oxygen diffuses into the metal and forms 
cuprous oxide. It will be shown later that compound formation is not 
at all necessary for embrittlement. During the reducing anneal, 
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hydrogen combines with the oxygen to form water vapour which 
ruptures the grain boundaries. 

WYMAN'S investigations led to some rather important conclusions 
concerning commercial annealing practice : 

"1. Neither pure copper when oxidized or deoxidized nor tough- 
pitch copper up to 0-06 per cent oxygen is embrittled by hydrogen at 
temperatures of 400C or below, at four hours. 

"2. (a) The rate of oxygen penetration into 'pure' copper decreases 
markedly below 800C. 

"(6) The rate of hydrogen penetration into oxygen-bearing copper 
decreases markedly below 700C. Consequently, up to about 800C, the 
coppers containing no oxygen are superior to tough-pitch coppers for 
resistance to the embrittling action of oxidation-reduction cycles. . . . 

"5. The elimination of oxygen by purifying process or by means of 
deoxidation of the copper is advantageous in increasing the resistance 
to the embrittling action of reducing gases.' 1 

In the final paper of WYMAN'S series it is shown that a steam atmo- 
sphere alone, even for periods of 8 hours at temperatures of about 1000C, 
is not sufficient to embrittle the copper. Any iron in the furnace will 
reduce the water vapour and make hydrogen available for the embrittle- 
ment of the copper, but this action is only important at temperatures 
above 700C. It was concluded that (with oxidized copper) hydrogen 
will cause embrittlement even at 500C if there is sufficient gas. 
WYMAN feels that the sources of gas are dirt or grease on the copper, 
volatile products from enamel, dirt, grease, lubricants, or other materials 
carried into the furnace extraneously. 

RHINES and ANDERSON 178 have definitely established that water 
vapour embrittlement is caused in copper containing oxygen, either in 
solid solution, or as precipitated cuprous oxide, when hydrogen is 
present in the annealing atmosphere. The embrittlement was found to 
be more severe when cuprous oxide particles were present. RANSLEY 179 
had shown that pure dry carbon monoxide will not diffuse into copper, 
and removes oxygen from the copper without causing embrittlement. 
This was confirmed by RHINES and ANDERSON who found that copper, 
annealed in carbon monoxide, was not embrittled on subsequent 
heating in hydrogen. In this same paper it was shown that hydrogen 
could reduce foreign metal-oxides, present as impurities in the copper, 
thus causing embrittlement. The oxides which, when reduced, caused 
the most severe embrittlement were those of indium, manganese, 
nickel, phosphorus, tin, and zinc. The oxides of columbiurn, gallium, 
lithium, magnesium, and zirconium were the least detrimental of those 
investigated, and the oxides of aluminium, arsenic, beryllium, bismuth, 
boron, calcium, cerium, cobalt, chromium, iron, lead, antimony, and 
titanium had an intermediate effect. Associated with RHINES' studies 
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of internal oxidation (q.v.), a minor type of embrittlement has been 
found to occur when the oxides of the subscale were reduced by the 
hydrogen. Another type of embrittlement, not mentioned above, was 
also found when copper, containing small amounts of arsenic, antimony, 
or bismuth, was annealed in hydrogen. This effect was thought to be 
due to the formation of the volatile hydrides of these metals creating a 
pressure sufficient to rupture the alloy. 

Many separate reports are to be found in the literature on the 
embrittling action of water vapour which is usually produced within the 
metal on annealing in a hydrogen-bearing atmosphere. Most of the 
investigations have been purely empirical and in many cases the results 
are applicable only to the particular conditions under which the work 
was done. In many cases rather complex alloy systems have been used, 
so that there were several variables to be considered. This, of course, 
often leads to invalid assumptions and erroneous conclusions. The 
present writer would prefer to review just a few of the outstanding 
papers, including those discussed above. Although the following 
remarks apply particularly to copper and silver, it must be recognized 
that similar phenomena must be expected wherever one is treating a 
metal which will dissolve both oxygen and hydrogen in suitable 
quantities. 

In 1926, SMITH and HAYWARD ISO presented evidence which was 
interpreted as showing that a sintering action occurs in copper when 
heated above 800C after having been embrittled by a hydrogen treat- 
ment. It was observed that cast copper containing less than 0-06 per 
cent oxygen could be deoxidized in a relatively short time by annealing 
at a high temperature in hydrogen. However, with copper containing 
larger amounts of oxygen (0-10 per cent), it was noted that, after 
annealing at very high temperatures, the copper had much better 
electrical and mechanical properties than similar copper annealed at 
only 800C. From metal lographic and other evidence it was suggested 
that at the lower annealing temperature, the hydrogen diffuses into the 
copper and reacts with oxide particles forming steam, which blows the 
metal apart. This initial rupture will allow more rapid penetration of 
the hydrogen, and the copper oxide eutectic areas rapidly become 
spongy networks of microscopic cracks. In the case of high-oxygen 
copper, it is claimed that the stress created by the steam formation 
strains the metal which, on heating, recrystallizes and during this 
process the cracks are healed. With low-oxygen copper, the oxide 
particles form in globules along the grain boundaries. It is suggested 
that the hydrogen can reduce this oxide with little effect on the copper 
lattice. Since no strain is present, the copper will not recrystallize or 
sinter, so the metal remains brittle. 

An excellent paper by RANSLEY 179 indicates that the mechanism of 
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embrittlement suggested above is unlikely. From data available at that 
time, RANSLEY made the following conclusions : 

"(a) Compound gases do not diffuse through solid copper. Dissocia- 
tion of molecules adsorbed at the surface may, however, be followed by 
penetration of the constituent atoms. 

"(b) The embritttement of copper in reducing atmospheres may in all 
instances be ascribed to penetration by hydrogen. When embrittlement 
occurs in gases such as carbon monoxide, hydrogen is present as an 
impurity, or is produced either by dissociation at the surface, or by a 
side reaction, e.g. CO + H 2 ^ C0 2 + H 2 . 




hr 

Fig. 21. Removal of oxygen from copper at 850 n C in hydrogen. 
U 700 mm pressure X 10-0 mm pressure Q 5-0 mm pressure 

2-0 mm pressure 
Broken line indicates theoretical diffusion of oxygen 

"(c) Reaction between gases in solution with the formation of a new 
phase cannot occur in the lattice. Reaction can only occur at an inter- 
face, i.e. in the present case at the copper surface, at an oxide inclusion, 
or possibly at a grain boundary." 

Since cracking does not necessarily originate at oxide particles, and 
since the effect always seems to be intergranular, it was suggested that 
the hydrogen and oxygen can exist in equilibrium in the lattice as 
separate atoms, but they diffuse to the grain boundaries and there 
react to form water vapour which ruptures the metal. 

On the basis of results published by PILLING, RANSLEY has suggested 
a mechanism which depends also on the diffusion rate of oxygen in the 
copper, a point neglected by SMITH and HAYWABD. At low hydrogen 
pressures (less than 26 mm Hg), it was noted that the rate of deoxida- 
tion of copper at 700C in hydrogen depended on the rate of diffusion of 
oxygen in the copper, and not on the hydrogen pressure. This meant 
that the deoxidation was a surface process. Further experiments were 
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performed by RANSLEY, who deoxidized copper at 850C with hydrogen 
at various pressures. The time-loss of oxygen curves obtained are shown 
in Fig. 21. At 760 mm pressure the metal was rapidly deoxidized and 
became very brittle. At lower pressures there was no significant change 
in the specimens until the conditions marked A v A 2 and A 3 were 
reached. At these points the specimens suddenly developed blisters. 
This blistering was attributed to the formation of water vapour beneath 
the ductile, deoxidized skin of the metal. RANSLEY has pointed out 
that blistering only took place when the solid deoxidation curves 
crossed the broken curve, which indicates the normal rate of diffusion 
of oxygen to the surface at 850C. These curves indicated to RANSLEY 
that, at a given hydrogen pressure, as long as the supply of oxygen 
available at the surface was sufficient to react with the hydrogen, then 
there would be no embrittlement. As the deoxidized layer becomes 
progressively thicker, however, there is a progressive decrease in oxygen 
diffusion, and finally the hydrogen will be able to enter the metal and 
cause water vapour formation and consequent rupture of the metal. 
RANSLEY concluded that: ''The removal of oxygen from copper in 
hydrogen will thus be a function of the oxygen content of the copper, 
the hydrogen pressure, the temperature, and the time of heating." 
As was stated earlier, RANSLEY'S work has also definitely shown that 
copper may be completely deoxidized, without deleterious effects on 
the mechanical properties, by annealing in carbon monoxide free from 
hydrogen. Carbon monoxide does not diffuse into solid copper. 

MARTIN and PARKER 176 found that silver, containing dissolved 
oxygen, suffered a decline in strength and ductility when heated in 
hydrogen. This effect is analogous to that in copper noted by RHINES 
and ANDERSON, 178 who found that water vapour embrittlement could 
occur without the presence of cuprous oxide particles. MARTIN and 
PARKER also found that, due to the rapid absorption of oxygen by 
silver, it is possible to embrittle silver by first annealing it in hydrogen 
and then in air. It was also found that when the silver specimens were 
annealed in air, a fine-grained structure was present near the surface 
but larger grains formed in the body of the metal. This grain-growth 
inhibition was apparently due to some property of the surface material 
only. When a layer of the surface metal was machined off the specimen 
prior to annealing, it was found that the fine grain structure did not 
form. It was concluded that the original specimen was not homogeneous, 
but no further tests were made. As was expected, specimens of gold 
and platinum showed no embrittlement on annealing in hydrogen 
after heating in air. The embrittlement of silver by heating in hydrogen 
did not occur below 500C. 

CHASTON 170 studied the inhibition of grain growth mentioned above, 
and also noted by RHINES and GROBE. ISI In the experimental work on 
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commercially pure silver (99-98-99-99 per cent silver) CHASTON ob- 
served that annealing in an oxygen-free atmosphere at temperatures 
above 600C caused rapid growth, while the same treatment in air 
caused impeded grain growth in the areas to which oxygen had diffused. 
By heating this silver in air at relatively low temperatures (500C) 
where grain growth is slow even in vacuo, enough oxygen may diffuse 
into the metal to prevent any grain growth even at 800C. On the 
other hand, it was also observed that grain growth occurred to an equal 
degree in spectroscopically pure silver when it was annealed at 600- 
800C in air as when annealed in hydrogen. It was found that an 
addition of 0-01 per cent copper, iron or lead to this silver restored the 
grain growth inhibition. CIIASTON attributed the grain growth inhibi- 
tion to the particles of the metallic impurity oxides formed by internal 
oxidation. 

The results of the work of MARTIN and PARKER were not in agreement 
with SniTHELLS 182 who claimed that silver saturated with oxygen 
could not be embrittled by hydrogen "because oxygen diffuses so 
rapidly out of silver that it has all escaped before the hydrogen can 
diffuse in." Actually embrittlemcnt by oxygen diffusion into hydrogen- 
saturated silver was found to be a minor effect, contrary to the opinion 
of SMITHELLS, who felt that the oxygen would diffuse in much faster 
than the hydrogen would diffuse out. However, CHASTON confirmed 
the possibility of embrittling silver by heating first in oxygen and then 
in hydrogen, the reaction between the gases occurring at temperatures 
above about 3()0C. By various experiments it was learned that the 
depth and severity of embrittlement was controlled by the treatment 
in oxygen, and that the variations in the hydrogen treatment had no 
effect, provided the metal was above 400C. 

CHASTON was not able to confirm the observation by MARTIN and 
PARKER that oxygen-free silver, annealed first in hydrogen, could be 
embrittled by heating in oxygen. CHASTON believes that the effect 
noted by MARTIN and PARKER was due to small amounts of oxygen 
retained in the silver prior to the hydrogen anneal. CHASTON was not 
able to produce embrittlement in silver with less than about 0-01 per 
cent of total impurities. The exact limiting amount of impurity which 
will restrain grain growth or promote embrittlement was not deter- 
mined. Another important observation was that these effects are not 
affected by the rate at which silver is cooled after annealing in air. 
CHASTON has claimed that this last observation is an argument against 
the possibility that dissolved oxygen may be an important factor in the 
embrittlement reaction. For this, and other reasons, a new mechanism 
of embrittlement was proposed involving the concept of a subscale 
formation of impurity metal oxides in the silver. CHASTON suggests 
that this finely dispersed phase would (a) hinder grain growth, (b) react 
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with hydrogen to embrittle the silver, and (c) harden the metal (see 
section on Internal Oxidation). It was considered that such a formation 
would account for all the observations. One weakness in the idea, 
suggested by CHASTON himself, is that the oxygen may be removed 
from the silver by a vacuum treatment below the melting point. One 
would not expect this to occur if relatively stable oxides had been 
formed. However, it was claimed that: "It is not inconceivable, 
however, that in these conditions the very finely divided oxides distri- 
buted through silver may be capable of reduction to the metallic state." 

Another effect, not strictly embrittlement, has been studied by 
EBOKALL and RANSLEY. 183 When an aluminium-magnesium alloy 
containing 7 per cent of magnesium, is heated with water vapour in the 
temperature range between the solidus and the liquidus, the metal 
blisters and blackens. The blistering is due to the absorption of 
hydrogen which is released at surfaces in the interior of the metal 
when the concentration of hydrogen in the metal has reached a certain 
value. At lower temperatures (400C) it was thought improbable that 
the hydrogen alone would cause blistering. However, if severe flaws 
were present in the metal, they could be enlarged considerably by 
annealing in a moist atmosphere. 

ZAPFFE has considered the above type of problem 184 and has suggested 
the concept of the "hydrogen potential." A hypothetical system is 
considered to consist of a closed chamber divided into two compart- 
ments by a diaphragm of the metal being studied. Steam, at a constant 
partial pressure of 1 atm, is admitted to one compartment. If the metal 
membrane is initially saturated with oxygen, the steam will react 
with it to form metal oxide and hydrogen. Some of the atomic hydrogen 
will diffuse throiigh the membrane forming molecular hydrogen in the 
second compartment, some will remain in solution in the metal, and 
some will form molecular hydrogen in the steam. The limiting equilibrium 
hydrogen pressure which is equal in both compartments is called the 
"hydrogen potential," and is a measure of the hydrogen absorbed by 
the metal. The case is also considered where the membrane is not 
initially saturated with oxygen. It is shown from this concept, for 
example, that nickel has a very low "hydrogen potential" which is 
related to the fact that nickel does not react with steam. The "poten- 
tials" of aluminium and magnesium, which react with steam, are much 
higher. The measure of hydrogen potential is related to the tendency of 
the metal to react with water vapour and, therefore, to form blisters. 



Several years ago PFEiL, 185 when experimenting with samples of 
almost pure iron containing crystals of about 1 in. in diameter, found 
that when such a specimen had been pickled for 30 minutes in 20 per 
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cent sulphuric acid, the crystals could be broken away from each other 
at the crystal boundaries. Apparently the metal was so brittle at the 
boundaries that fracture occurred without any marked deformation in 
the body of the individual crystals. Since that time, many experiments 
have been done to study the detrimental effects of hydrogen in iron and 
steel particularly embrittlement and hair-line crack formation 
caused by prolonged soaking in hydrogen atmospheres, pickling, or 
electrolytic plating. A survey of the extensive literature has shown 
that little is known concerning the fundamental factors involved in 
these problems. Many of the papers are purely descriptive and deal 
with relatively complex systems which are not suited to a study of the 
problem. The writer appreciates the importance to industry of knowing 
how to overcome these effects in certain special cases ; however, one 
feels that it is more important to answer the fundamental, "Why?" 
Fortunately, in more recent years, there have been a few studies 
devoted to answering the question. 

Some of ANDREW'S 186 earlier papers dealt with the effect on mechanical 
properties and hair-line crack formation of hydrogen introduced into 
laboratory specimens of steels. Later, SYKES, BURTON and GEGG 187 
attempted to interpret this work in terms of industrial practice. The 
hydrogen contents of a number of steels were determined at different 
stages of the steel-making process. It was confirmed that a hydrogen 
content greater than 2 c.c. per 100 g would reduce the steel's ductility. 
This is significant since it was also found that all the steels produced 
contained 4-6 c.c. of hydrogen per 100 g. Depending on the thermal 
history of the specimen, high hydrogen contents may or may not cause 
hair-line cracks. Although other factors may modify the phenomenon, 
it was definitely concluded that steel with a high hydrogen content is 
more liable to crack during heating and cooling than hydrogen-free 
steel. It was not suggested that hydrogen alone is the cause of crack 
formation. In fact, in slowly cooled material, free from macro-stresses, 
the hair-line cracks were attributed to the combined effects of hydrogen, 
local stresses, and low ductility in a preferred direction. The reader is 
referred to the paper for complete details of this interesting work, and 
for the puzzling problems which had to be left unanswered. 

Recently ANDREW 188 has published the results of a comprehensive 
study of the relationship between hydrogen and transformation 
characteristics in steel. Crack formation and embrittlement were found 
to be closely associated with each other, and with the general theme 
of the study, but the amount of hydrogen required for embrittlement 
is less than that needed for hair-line crack formation. In each case the 
crack formation is due to "disruptive hydrogen pressure being built up 
in internal cavities or voids." This pressure would be caused by the 
diffusion of atomic hydrogen into any imperfection in the lattice which 
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would hold more than one hydrogen molecule. It was also suggested 
that the reaction between hydrogen and iron carbide might create a 
pressure of methane sufficient to rupture the lattice. Finally, ANDREW 
emphasized that hydrogen diffusivity and solubility are the two main 
factors determining the behaviour of a given steel in response to hair- 
line cracks. Unless the effect on these factors of structure and stresses 
in the steel is known, full understanding of the phenomenon cannot be 
reached. The results of some work on diffusion have now been pub- 
lished by CHANG and BENNETT, 189 who investigated the effects of 
chromium, nickel and molybdenum on the rate of diffusion of hydrogen 
in iron. It is to be hoped that this is the first of a series of experimental 
studies carefully planned to reveal the fundamental reason for the 
phenomena. 

In another paper, ANDREW and LEE 190 discuss the hydrogen pheno- 
mena in steel on the basis of internal pressures caused by the hydrogen. 
It is suggested that at high temperatures the hydrogen is retained in 
the lattice in atomic form in solid solution, however, at lower tempera- 
tures the hydrogen may be occluded in molecular form at the crystalline 
boundaries. If the hydrogen is precipitated from solution at high 
temperatures, plastic yielding will relieve the stresses. At lower 
temperatures the precipitating hydrogen creates internal stresses in the 
metal. Once the pockets of molecular hydrogen have been formed, 
further diffusion is impossible. It is claimed that the concept of 
molecular hydrogen concentration at the lattice imperfections (crystal- 
lite boundaries) must be feasible since it could explain why the composi- 
tion and thermal history of the steel control the amount of hydrogen 
present in it. It was shown how the amount of hydrogen retained by a 
steel specimen increased with the increasing hardness of the specimen 
produced by rolling. This cold work would increase the volume of 
crystallite boundaries, and, therefore, the available volume for mole- 
cular hydrogen precipitation. Just as a plea was made for more funda- 
mental research on diffusion, so a need was expressed for a clearer 
understanding of the effects of composition and treatment on the 
crystalline substructures. 

HOBSON and SYKES ISI have confirmed a correlation between the 
ductility and the hydrogen content of low-alloy steels. Hydrogen was 
found to have little effect on the shape of the stress-strain curve, but 
fracture was caused before elongation and reduction in area had 
reached normal values. As in the other papers mentioned, it is stressed 
that the hydrogen content alone does not determine whether or not 
hair-line cracks will form, but it is a major factor. 

The author wishes to thank DR. J. N. PRATT, who read the first draft 
of this chapter and made many helpful suggestions. He also wishes to 
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Universities and The International Nickel Company of Canada, Limited, 
who provided a fellowship for a research programme of which this 
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4 
THE THEORY OF SINTERING 

G. A. Geach 

POWDERS of most metals will sinter when heated to about three- 
quarters of their absolute melting points while protected from oxidation 
or other gaseous attack. Permanent adherence occurs between the 
particles, the whole mass becomes less porous and its other properties 
change towards those of the dense metal. A great many studies of these 
changes have been published, but in almost every case the cold metal 
powder was mechanically compressed before sintering. This, of course, 
is the usual procedure when sintering is applied in industrial powder 
metallurgy. Each such study has taken into consideration only some of 
the factors which can affect the progress of sintering. Published work is 
reviewed by KIEFFER and HoTOP 46 and, comprehensively, by GoETZEL. 28 
Pressing increases the initial areas of contact between particles and 
reduces the porosity before sintering ; it is usual to reach a far lower 
porosity when sintering a compressed powder than if a loose powder is 
used. Only very few studies of the sintering of unpressed metal powders 
have been published (SCHLECHT, SCHUBARDT and DuFTSCHMiD 69 ; 
DELiSLE 18 ; SHALER and WuLFF 73 ) but from these and from other 
related work there is no suggestion that the basic course of sintering is 
modified by compression. 

Most metals are known to sinter, although those of quite low melting 
points, and a few metals of normal melting points but complicated 
structure and, perhaps, largely covalent binding, may be exceptions. 
Thus, germanium does not sinter even at 850C (90 per cent of its 
melting point). HUTTIG and HENNiG 41 have suggested it to be a general 
rule that all metals sinter appreciably at about three-quarters of their 
absolute melting points if interference by extraneous factors is avoided. 
This tends to give too high sintering temperatures for metals of normal or 
high melting points ; if it applies to metals of low melting points cad- 
mium and lead should sinter at 200C and tin at about 100C. No 
sintering at all has been observed in tin. SAUERWALD 68 could find no 
adherence between zinc particles at 360C, although WOLF SO observed 
coherence when zinc was extruded through a die at 300C. KiKUCHi 48 
compared the hardness of metal powder pressings after heating them to 
different temperatures and, as would be expected, found the hardnesses 
of the worked metals to fall after increasing the temperature. In the 
case of zinc, however, the hardness rose slightly again after a minimum 
at 400C. Silver also had a minimum hardness at 700C, which is a 
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conventional sintering temperature for this metal. It is probable that 
with both metals some bonding of the actual particles commenced at 
the temperatures of minimum hardness. 

The extent to which oxide surface films interfere with the sintering 
of the low melting-point metals is not clear ; available data indicates 
that interference of this origin is at most only partial. Lead, for ex- 
ample, has oxides which are reduced by carbon monoxide at very low 
temperatures, but lead powder does not sinter perceptibly when heated 
to within 5C of its melting point for 7 hours in atmospheres of hydrogen 
or carbon monoxide, or in vacuum. Comparison of the behaviour on 
heating of zinc powder prepared by filing sticks of zinc under carbon 
tetrachloride, and never exposed to air, with that of similar filings which 
were left in the atmosphere for an hour before heating, showed a dis- 
tinctly greater coherence in the unoxidized material after 2 hours at 
380C (95 per cent of its melting point) but the extent of this sintering 
was not comparable with that occurring in copper and similar metals 
at 50 per cent of their melting points. 

Many inorganic substances such as silica, refractory oxides and 
silicates, and halides of high melting points also sinter. So, also, do 
powdered glasses. Sulphur, silicon, carbon and pure organic materials 
which have low melting points and contain rather large permanent 
molecules or structural units seem not to sinter. Sintering appears to be 
a characteristic of substances which, at the sintering temperatures, 
contain such small structural units as atoms or simple ions. 

Sintering may be considered to consist of two stages : one is the 
welding of particles which are in contact and the other the elimination 
of porosity. The welding is not remarkable and it is well known that 
absolutely clean surfaces of many materials adhere when brought 
together. The shrinkage, however, implies the rapid movement of 
matter into the pores and the principal requirement of any theory of 
sintering is that it explains this flow. 

At one time it was suggested that attractive forces between particles 
of a powder developed at some characteristic sintering temperature 
(SAUERWALD 66 ) but there is no evidence that such forces do depend 
upon temperature. Heating is required to effect the removal of obstruc- 
tions between particles and to make it possible for the area of interface 
to increase and the mass to grow less porous. The union of pieces of 
cold metals occurs in pressure welding which takes place when clean 
surfaces of such metals as aluminium, cadmium or copper are squeezed 
together. A true weld is obtained across the interface. The only prepara- 
tion required is cleaning the surfaces of oxide and grease; pressing 
serves to bring clean metal surfaces together over the welding area. 
It is inconsequent to the formation of a weld whether recrystallization 
occurs across the interface. 
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The earlier observation by BEILBY S that filings of zinc will adhere in 
the cold to steel and those of silver to a silver surface may also be a 
demonstration that the attraction across an interface does not require 
an elevated temperature. 

The cohesive force between small particles of quartz and of sodium 
pyroborate was measured by BRADLEY, U who found it to be about 
10 dynes between quartz spheres 0-05 cm in radius. This would be 
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Fig. 1. Changes in length of copper compacts pressed at 1-4 X 10 6 g/cm 2 
and then sintered isothermally in hydrogen 41 

about the order of a tensile strength if the contact was 10 ~ 4 cm across. 
BRADLEY found his results very sensitive to the atmosphere about his 
specimens and so, presumably, to the surface films upon them. The 
possibility that a direct attractive force between atoms in two particles 
might act over distances of much more than atomic dimensions has been 
examined by SHALER. ?I He began with the Sommerfeld model of a 
metal as positive ions in an electron cloud and, for copper, deduced 
a force of about 2 x 10 6 g/cm 2 acting across 120 A. The effect of 
temperature would be small. Action across such long distances was 
questioned by OROWAN, 71 ~ di8CU88ion who pointed out that by such forces 
a 1 cm diameter copper sphere would support another hanging from it 
after they had touched. 
The properties in which changes during sintering are most simply, 
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and have been most often, followed are the porosity of the specimen and 
those such as density or length which are related to it. At any given 
sintering temperature change is at first rapid and becomes gradually 
slower. 

It is not clear whether, at constant sintering temperature, the porosity 
of a material tends always to zero as sintering is prolonged or whether it 
comes to a limiting value. SMiTHELLS 75 states that tungsten reaches a 
fixed density on sintering for about 15 minutes. It appears as though 
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Sintering temperatures as a fraction of the absolute melting points 

Fig. 2. Densities of different specimens after sintering for 
fixed periods at different temperatures 30 , 62 

crystalline inorganic materials, such as refractory oxides, come to con- 
stant porosities, but that glasses have no such limits and tend continu- 
ously to zero porosity. The most sensitive measurements on metals 
show them to become continuously less porous during long experiments ; 
isothermal sintering curves for copper obtained by sensitive dilatometry 
show this for times up to 60 hours (Fig. 1). 

Whether porosity actually tends to a limit or in fact to zero, it does 
come to a roughly steady value in a few hours at practical sintering 
temperatures. This value depends very markedly upon that tempera- 
ture. Much of the published experimental work has reported such 
apparently limiting densities for different temperatures. Some typical 
examples of density, as a fraction of that of the compact metal, are 
plotted against sintering temperature, as a fraction of the absolute 
melting points, in Fig. 2. 

Results for iron are interesting as these show a very marked increase 
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in porosity at the oc-y transformation temperature. SCHLECHT, SCHU- 
BARDT and DuFTSCHMiD 69 sintered unpressed carbonyl iron powder and 
made metallographic sections after different sintering treatments, as 
well as measuring the practical limiting densities reached. Increase 
in pore size after passing the oc-y transformation temperature is clear in 
their micrographs ; there was little rounding of the pores below 1200C. 

DuwEZ 22 has also examined iron and found that the rate of densifica- 
tion at 982C was only one -thousandth part of the rate at 816C. 
Shrinkage at 982C could not, in fact, be detected after two months. 

SHALER ?I made density measurements on specimens of sintered 
unpressed copper. In 6 hours at 930C in vacuum the density rose from 
50 per cent to 95 per cent that of copper, and the remaining pores were 
sealed off from the surface. However, in similar experiments with 
atomized copper powder GEACH and JONES observed quite different 
rates of sintering, densities after 60 hours sintering at 900C being 75 per 
cent that of copper and at 1000C, 80 per cent. In their specimens no 
pores were closed off from the surface. Different results from such 
similar experiments may be largely due to differences in the powders 
used ; in this instance SHALER used copper powder of a less uniform 
size. 

An initial expansion before the shrinkage normal to sintering is often 
noted. This is usually quite small and most commonly reported during 
slow sintering at low temperature. Thus it occurs in tungsten, which 
may increase in length by about 0-2 per cent at 600C, and DRAPEAU 19 
obtained more considerable expansions of this sort (2 per cent in length) 
in pressed copper specimens heated for 2 to 4 hours at 760C. The 
expansion is commonly said, but has not been shown, to be due to gases 
contained in the metal. That it does not depend, as BALSHIN ? suggested, 
on cold work and subsequent recrystallization was shown by SHALER 
and WuLFF, 73 who sintered unpressed copper to a density of ?! g/cm 3 
by heating for 16 hours at 850C in argon, and on then heating the same 
specimen in vacuum for a further 8 hours found the density to fall to 
5-3 g/cm 3 . In this experiment any possible cold work in the material 
was annealed out during the first heating. 

CLARK and WHITE 14 found that pressings of chromic oxide also 
expanded during the first part of sintering at 1550C ; the density fell 
from 2-16 g/cm 3 to 2-10 g/cm 3 in 16 hours, after which it rose again. 
However, throughout this expansion, the strength of the pressing 
increased, so that bonding together of particles appeared to go on 
during the expansion. 

Measurements of mechanical properties of sintered powders have 
practical value, but generally little more; these properties are very 
sensitive to many factors in addition to sintering. HuTTiG 39 has 
examined such properties of sintered metals as the rate of solution in an 
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acid or the amount of methyl alcohol vapour adsorbed ; these properties 
are all connected with the volume of pores open to the surface of the 
specimen. KIEFFER and HoTOP 47 have collected and compared many 
properties of sintered and melted metals. 

It is noteworthy that the temperatures at which the changes in 
different properties of a material are most marked may vary quite 
widely. This is very clear on comparing electrical resistivity, which is 
sensitive to the early growth of interparticle welds, with density. Thus, 
HAUSNER and PiNTO 35 examined the sintering of beryllium in vacuum 
and found that, while the resistivity fell abruptly at about 700C 
(170 to 50 microhm-cm), the sharp increase in density took place 
above 1000C. This difference in temperatures is quite understandable 
because it will be the early stages of sintering, when welds between 
particles develop, that will markedly influence resistivity, but the 
structural changes then occurring in the general mass will not be great. 
It is at higher temperatures that pore size is altered rapidly and that 
great changes occur in density. MYERS 59 studied resistivity changes in 
tantalum pressed at 7 x 10 6 g/cm 2 (50 tsi) and sintered for 30 minutes 
in vacuum. He obtained a marked drop in resistivity at 600C and 
smaller ones at 1200 and 2200C. Density did not change appreciably 
until this last temperature was reached. MYERS suggested that the 
earlier stages are due to the removal of gases and impurities, but 
HAUSNER and DEDRiCK 34 supposed the second step due to evaporation 
of tantalum within pores and its condensation at necks. Resistivity is 
very dependent upon particle size; thus for copper pressed at 
1-4 X 10 6 g/cm 2 (10 tsi) and sintered 14 hours in hydrogen, HAUSNER 33 
gives the following resistivities : 

Resistivities in Microhm-cm 





Sintering Temperatures 


Particle Size 


600C 


800C 


1000 C 


2 x 10- 4 cm . 


5-01 


2-42 


1-74 


44 x 10 4 cm 


5-37 


3-64 


1-97 


44-74 x 10- 4 cm . 


5-6 


3-78 


2-52 



Resistivity values often give a smooth curve when plotted against 
sintered densities. 

Several workers have made metallographic examination of the welds 
between particles and in general found that no changes are to be 
detected during early stages of sintering when the compact as a whole 
gains strength and density. This is in agreement with the fact that 
crystal growth across the interface is not essential in pressure welding. 
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LiKHTMAN, 66 however, reports the observation of minute bridges formed 
between particles of electrolytic copper at temperatures as low as 
300C. 

BIER and O'KEEFE 9 found that in specimens of copper reduced from 
the oxide and pressed at 30 tsi the original particles only ceased to be 
identifiable on polished sections after sintering at quite high tempera- 
tures, for instance after 30 minutes at 900C. COOK and PuGH 15 
examined the changes in the structure of three types of copper powder 
on sintering. They found that the larger original particles retained their 
identities after sintering at temperatures between 400 and 800C, but 
at 800 to 1000C the whole specimen acquired a fine-grained equiaxed 
and porous structure. Recrystallization generally occurred at much 
lower temperatures in the largest particles. 

In electrolytic iron, studied by LIBSCH, VOLTERRA and WuLFF, 55 
after pressing at 46 X 10 6 g/cm 2 (33 tsi) and sintering for 4 hours in 
hydrogen, little change could be seen at 500C (density 6-4 g/cm 3 ) but at 
625C (density 6' 8 g/cm 3 ) recrystallization in grains and boundaries ap- 
peared. At 725C (density 7-0 g/cm 3 ) a new structure of uniform equi- 
axed grains less in size than the original particles had developed, and 
from 825C (density 7-2 g/cm 3 ) to 900C (density 6-9 g/cm 3 ) very marked 
grain growth occurred. Increased porosity near the a-transformation 
temperature was thought not to be due to the transformation itself, but 
perhaps to liberation of gases in the metal ; the increase appeared to be 
spaced over a 100C range about the transformation point. Powders 
annealed before sintering changed less than similar material that had 
been strained but not annealed ; thus an electrolytic iron powder with 
particles in a normal strained state increased in density from 3-0 to 
3-7 g/cm 3 on heating for 24 hours at 700C but if the powder used was 
first annealed no change in density took place under these conditions. 

Pores in the iron specimens appeared larger and were more rounded 
after sintering at 900C than at lower temperatures. The fact that pores 
are usually more apparent as sintering proceeds probably arises because 
flattened porous areas around grains change to more noticeable round 
shapes. In a pressed powder the pores will tend to be flattened and 
to extend thinly normal to the pressing direction. As such pores become 
rounded shrinkage should be more marked normal to than along the 
pressing direction ; this has been observed, for instance, by KAMM, 
STEINBERG and WuLFF 45 for copper. STEiNixz 76 associated changes in 
the magnetic permeability of pressed iron powders when sintered with 
the same change in pore shapes. Specimens pressed at 1-4 x 10 6 to 
14 x 10 6 g/cm 2 (10 to 50 tsi) had permeabilities after sintering for 
1 hour at 1150C which agreed with theory for a mass having the 
porosity distributed as flat discs. Specimens sintered at 1250 to 1350C 
for 24 hours had higher permeabilities than those of the same densities 
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obtained at lower sintering temperatures, but on re-pressing the permea- 
bility fell back to the * 'flat-pore' * value. 

Small pores disappear before larger ones; for example RHINES, 
BIBCHENALL and HuGHES 63 found that the average pore size in copper 
powders increases on sintering. The number of pores of cross sections 
of 9 X 10~ 8 and 12 x 10~ 6 cm 2 in 0-0293 cm 2 of surface after sintering 
in hydrogen are given in the table to illustrate these points, and Pig. 3 




6 9 13 17 

Pore diameter 



25 



Fig. 3. The population distribution of pores in copper compacts 
after sintering in hydrogen at 1000 C 

shows the general effect. Results are rather more erratic but the general 
trends are the same when sintering in argon or in vacuum. 

Number of Pores in 0-0293 cm 2 of Section 



Entering Temperature C 


Sintering Time (Hours) 












1 


10 


100 


1000 


( 800 


1496 


1370 


1423 


1339 


10~ 8 cm 2 pores . { 900 


1654 


1423 


850 


757 


I 1000 


1415 


1024 


392 


66 


/ 800 





1 


5 


5 


10~* cm 2 pores . { 900 


16 


27 


69 


65 


I 1000 


65 


73 


93 


113 



These workers also reported that regions near the outer surfaces of a 
specimen appeared to become free from pores before the centre did. 
GKEENOUGH 29 has made a very interesting observation on the 
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behaviour of small "blow holes" (about 0-001 cm across) in silver. The 
specimen was cold rolled sheet in which the holes were randomly 
distributed. After it had been heated for 19 hours in nitrogen at 900C 
almost all the blow holes were collected on the new grain boundaries of 
the metal. GREENOUGH also noted that the average size of these holes 
had increased. He suggested that the holes move with boundaries, 
part of the area of which they eliminate, but it is not at all clear how in 
this phenomenon material crosses a moving hole. In the same specimens 
large grooves along boundaries at the surface of the silver and a depres- 
sion of some grains below the level of others were noted. This implies 
that holes move along boundaries, intact or as vacant sites ; apparently 
one grain dominates the supply of atoms to move a hole. GREENOUGH 
questioned the validity of any assumption of random distribution of 
pores on sintering ; he believed that, if sintering involved a plastic flow 
of matter, a body with pores distributed on planes in this way should 
increase in density more rapidly than one in which they were randomly 
distributed. 

Experiments in which copper wires from 0-005 to 0-050 cm diameter 
were wound in close-packed arrangement on bobbins, heated in hydrogen 
and then sectioned on a diameter for metallographic examination have 
been made by GEACH and JoNES 27 and by ALEXANDER and BALUFFi 2 ; 
these gave pictures resembling an ideal section through contacts 
between close-packed spherical particles of powder. The former workers 
found the original boundaries between wires to be surprisingly per- 
manent and to remain generally visible long after the grains within the 
wires were of the same order of cross section as the wires. The rounding 
of the "triangular" spaces between wires depended very markedly on 
wire diameters. Thus at 1000C the pores between 0-005 cm diameter 
wires became circular in section in 6 hours, but between 0-013 cm 
diameter and larger wires they remained clearly triangular. The fifth 
power of the width of the junctions between wires was roughly propor- 
tional to the time of heating and also to the square of the diameter of 
the wire. Contraction of pore area was slower than that occurring in 
powders. ALEXANDER and BALUFFI found a cavity to shrink so long as 
a grain boundary ran into it ; when this was no longer the case the 
cavity would still grow more rounded in shape, but no further change 
in area occurred. As boundaries disappear more quickly at higher 
temperatures the pores did not shrink so much, although what change 
there was took place more quickly. The same authors recorded that the 
rate of sintering of silver powder is sensitive to the grain size and the 
consequent concentration of boundaries. 

The behaviour of synthetic pores consisting of holes 0-016, 0-022 or 
0-026 cm radius and 1 cm long drilled in small bars of copper, when these 
were heated in vacua, have been examined by POSTLETHWAITE and 
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SHALER 62 ; the specimens stood vertically while heating. The radii 
fell to minimum values after times of 30 to 70 hours heating and rose 
again ; in some instances there was an initial increase before dropping to 
the lowest value. Typical values are : 



Time (Hours) 





10 






(cm) 


(cm) 




10000 . 


0-0260 


0-262 




1050C . 


0-0262 


0-261 





30 


40 


60 


100 


(cm) 


(cm) 


(cm) 


(cm) 


0-263 


0-260 


0-254 


0-257 


0-259 


0-266 









KuczYNSKi 50 ' 51 and KUCZYNSKI and ZAVARiNE 53 studied sections 
through the joints between round particles of copper, silver or glass and 
flat surfaces of the same materials on which they had been heated 
(Fig. 4). A boundary line at the interface generally remained visible 
and, in some cases, it appeared to have moved slightly into a copper 
particle which then stood raised on a slight pedestal above the flat 
surface. However, as the particles were not strictly round in section it 
is difficult to be certain of this. The sharp angles at points of contact 
became rounded as the width of these interfaces grew; KUCZYNSKI 
measured these widths. These experiments must have been trouble- 
some, as rounded particles had to be found and the sections had to be 
taken through the centres of the junctions. The glass particles examined 
were more nearly round than the metals and showed both the rounding 
of angles and the build up of matter above the original flat surface. 

In one of the sections published by KUCZYNSKI the flat surface of the 
copper block appears to have a shallow "moat" around the interface; 
this specimen had been heated for 4 hours at 700C in hydrogen, the 
copper particle having a radius of about 0-003 cm; however, the 
available data cannot justify an assumption that this structure is usual 
if, perhaps, commonly less apparent. KuczYNSKi 52 obtained most 
interesting sections through the junctions between a copper particle and 
a nickel plane or a nickel particle and copper plane (Fig. 5). In each 
case a very deep groove appears in the copper around the junction. The 
formation of these grooves would be easy to understand if rapid surface 
diffusion of the copper atoms might be assumed ; the convergence of the 
groove as it reaches deeper into the copper might also follow, for the 
mouth of the opening must move out from the original point of contact 
as the area of the weld increases. It is not apparent why any other 
mechanism should give vacancies condensing to single large cavities 
open to the surface of the copper but penetrating deeply into it. 

A single layer of 0-03 cm radius copper particles was sintered by 
DEDRICK and GBRDS I? using temperatures from 700 to 1025C. The 
widths of the interfaces between particles were measured and varied 
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from 0'0036 to 0*0122 cm in experiments with sintering times up to 
100 hours, and it was found that the fifth power of the ratio of this 
diameter to that of the particles gave a straight line which passed 
through the origin if plotted against sintering time. SHALER ?I has also 
examined the growth of the junctions between graded copper spheres 
0-15 to 0*20 cm in diameter. These were first annealed and then 
arranged in contact in a single row and heated in hydrogen. SHALER 
plotted the angle subtended at the centres of particles by the radii of 
the junctions against sintering times; this gave curves with linear 
portions for long sintering times but these did not extrapolate through 
the origin. 

The rate of sintering of a material is generally very sensitive to the 
nature of the atmosphere. Metal powders commonly reach any given 
density at a lower temperature or, at a given temperature, in less time 
in hydrogen or other reducing atmosphere than in argon or vacuum. 
Thus at 980C copper reached the same density in about 2 hours in 
hydrogen as after 16 hours in a "vacuum" of 0-5 mm Hg. In the case 
of beryllium, HAFSNER and PiNTO 35 found that after pressing beryllium 
at 2-1 x 10 6 g/cm 2 and sintering for 3 hours at 1200C it had a density 
of 1-14 g/cm 3 when the sintering was in argon, but of 1-66 g/cm 3 when 
it was done in air under a pressure of 0-0005 mm Hg; a less marked 
difference was found when using specimens which had been subjected to 
higher pressures. A similar sensitivity of refractory oxides, such as 
beryllium oxide, to the atmosphere in which they are sintered may, it 
has been suggested, be due to easier diffusion following the production 
of vacant oxygen sites by surface reaction with the atmosphere. Zinc 
oxide and titanium oxide depart from stoichiometry in air at sintering 
temperatures, but alumina does not ; the first two sinter at two-thirds 
of the melting point absolute, the latter at about 0-85 of the melting 
point. 

LiKHTMAN 56 attributed the formation of small interparticle bridges 
on heating copper at very low temperatures to mobile copper atoms 
formed as the surface oxide is reduced, or to evaporation and condensa- 
tion of the metal. He observed that in sealed-off evacuated apparatus 
in which copper was heated to 300 or 400C, visible films of copper 
formed on liquid nitrogen traps. 

The rate of sintering is affected by powder particle size. Fine powders 
usually sinter at rather lower temperatures than coarse ones, although, 
as the finer powder may give the less dense pressing, it may still be the 
more porous metal after a given sintering treatment. Uniformly sized 
powder does not sinter as rapidly as a graded mixture; results for 
electrolytic iron powders pressed at 4-6 x 10 6 g/cm 2 (33 tsi) and sintered 
for 4 hours at 825C in hydrogen illustrating these points are given by 
LIBSCH, VOLTERRA and WuLFF 55 : 
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Fig. 4. Copper sphere sintered to a plane 
copper surface by heating in hydrogen for 
one hour at 900 "C. Magnification X 770 




Fig. 5. Sections through: Left a copper particle sintered 
to a nickel plane by 2 1 hours heating at 950 n C in hydrogen; 
magnification X 770. Right a nickel particle sintered 
to copper by 48 hours heating at 1020 "C in hydrogen; 
magnification X 220. Black areas are pores, not rounded 
in nickel (KucxYNSKi 52 ) 
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Powder Size (Sieve Mesh) 


Tensile Strength 
(10 8 g/cm 2 ) 


Density 
(g/cm 3 ) 


(A) 100-200 


2-01 


7-20 


(B) 200-325 


1-86 


7-06 


(C) 325-500 


2-11 


6-90 


4 parts (A) + I part each of (B) and (C) . 


2-18 


7-27 



Gold of such fine particle size as 10~ 6 cm sinters at 150C as shown by 
changes in its resistivity and in its infra-red reflectance which, it is 
shown, are related (HARRIS, JEFFRIES and SiEGEL 32 ). 

These very low sintering temperatures were found by using particles 
of gold freshly made by condensation of the vapour ; if the material 
were first "stabilized" by temperatures below 80C it sintered less 
readily. It was suggested that the particles are in an unstable state when 
freshly prepared and that stability follows a diffusion of vacant sites to 
the surface ; an activation energy for this stabilization process of 
23,500 1300 cal/g atom between 50 and 70C was found. 

KALISCHER 44 gave expansion-contraction curves for annealed and 
unannealed nickel powder compacts when heating in hydrogen at 
13-3C per minute. The unannealed powder began to contract at a lower 
temperature and shrank more than the annealed powder. 

WARREN and LiBSCH 79 examined the sintering of 50/50 iron/cobalt 
mixtures. A specimen heated rapidly to 1400C and sintered had more 
pores and larger pores than a similar specimen first heated between 800 
and 1400C before sintering. DELISLE IS obtained some indication that 
loose iron powder sintered less readily if annealed than if strained. 

Similarly, pressing a powder causes sintering to occur more readily and 
leads to a lower porosity after any given sintering treatment. Typical 
results are those for molybdenum obtained by GRUBE and ScHLECHT. 31 
After sintering for 3 hours in hydrogen they obtained the following 
densities : 



Pressure (kg/cm 2 ) 

Density (g/cm 3 ) after 1000C 

Density (g/cm 3 ) after 1700C 






1000 


2000 


7000 


2-6 


4-6 


5-6 


7-6 


6-5 


9-4 


9-5 


10-0 



Slightly greater porosity found in some metals after pressing at very 
high pressure is probably due to expansion by enclosed gas. The effects 
of pressure on several metals are reported and surveyed by BALKE. G 

The application of pressure during sintering has a very marked effect. 
LAURENT and EuDiER 54 measured the widths of the junctions between 
0-06 cm diameter copper spheres and plane surfaces after heating in 
Kuczynski-type experiments. On heating for 100 minutes at 1050C in 
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cracked ammonia a junction with a diameter of 0-014 cm was formed ; 
if a load of 10 gm pressed the sphere on to the plane the same heating 
gave a diameter of 0-036 cm. Loads reaching up to this value are pro- 
duced by the pressures used in technical hot pressing practice. 

Copper was pressed cold under a pressure of 8-8 x 10 5 g/cm 2 
(1 2,500 psi) to make compacts which were then sintered in hydrogen 
while under loads of up to 1-16 x 10 4 g/cm 2 by RHINES and CANNON. 64 
No flattening of pores could be discovered after sintering at 1000C. 
Density increased more rapidly on loading, and in such a way that the 
logarithm of the time required to reach any chosen density was related 
linearly to the stress used. Typical results for a density equal to 95 per 
cent of the bulk density of copper are : 



Time (Hours) . 


38-0 


27-2 


14-5 


4-0 


Stress (g/cm 2 ) . 





700 


2100 


4900 


log(Vt)/stress . 





0-0002 


0-0002 


0-0002 



Here t is the time to reach the density with the specimen under stress 
and t the time when it is under no load. The function in the third row 
is constant for a linear relationship. 

Any given load produces a constant proportional increase in rate for 
all times of sintering. A "densification parameter" ft (q t qj>)/q t , 
where q t and q p are the porosity of specimens given the same sintering 
treatment unloaded and loaded, respectively, is independent of time for 
each load. Loading within the range studied does not , then, introduce 
a fresh mechanism into the processes of sintering. The effect of stress 
on sintering found in this work is, as RHINES and CANNON point out, 
similar to that of stress on creep rate. 

It is now generally assumed that reduction of surface area and so of 
surface energy provides the driving energy for sintering. The energy so 
provided is small ; early speculations have considered other possibilities 
but it seems that these can make only incidental contributions to the 
cause of sintering. The surface energy of copper powder with particles 
of 0-005 cm radius would be about 2 x 10~ 2 cal/g (0-1 joules/g). If 
copper powder were compacted by hot pressing, a pressure of about 
10 4 g/cm 2 (200 lb/in. 2 ) would be required to supply about the same 
energy to the specimen. 

The energy introduced into the powder particles by cold pressing and 
made available on heating has also been suggested as a factor in sinter- 
ing. BALSHIN ? supposed that recrystallization occurs by a re-organiza- 
tion following cold work and that individual atoms diffuse to nuclei 
which grow at the junctions between particles. The fact that the 
greater part of sintering usually occurs at times and temperatures 
beyond those needed for full annealing and that much densification 
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takes place after this anneal leaves much action by recrystallization 
improbable. 

SAUERWALD, 66 in an early speculation, suggested that at some charac- 
teristic temperature which is about three-quarters of the absolute 
melting point and not affected by pressing, an interchange of atoms 
occurs between particles and leads to grain growth. Raising the tem- 
perature should then aid adhesion by increasing atom mobility and by 
smoothing away surface irregularities. 

SMITH, 74 in another early speculation, suggested that sintering might 
depend upon changes in structure from amorphous to crystalline or 
from one crystal form to another, these presumably being supposed to 
free atoms for individual movement. SMITH'S own determinations of the 
lowest temperatures at which sintering of unpressed powders of differ- 
ent metals could be detected by prodding with a wire hardly support 
the view ; he detected sintering in iron at 250C, in copper at 250C, 
silver at 180C and molybdenum at 800C, but none in manganese 
or tin. 

Surface tension at a pore might induce local flow or surface diffusion, 
but density changes require flow away from the pores at positions 
where any direct effects due to surface tension are "diluted" away. 
Considerable interest now centres on values of the surface tensions of 
solids and of the energies of grain boundary areas. Recent work has 
been reviewed by FISHER and DuNN. 24 In one method of determining 
the surface energy of solids, weights are suspended from foils or wires 
which are heated, and the weight at which no change in length occurs is 
determined. This is the force required to balance the effect of surface 
and grain boundary tensions. UmN 77 has shown that the grain boundary 
effect in such experiments is not, as earlier workers supposed, trifling, 
but that it can be dealt with in wires of "bamboo-like" structure, that 
is, having grain boundaries normal to their axis and occupying the full 
cross section. If W is the load giving no change in length, UDIN puts 



W = -rrRy - NrrR 2 y n .... (1) 

where R is the radius of the wire, N the number of grains per unit length 
and y u the grain boundary energy. He believes that y B y/3 and thus 
deduces from the experimental results of UDIN, SHALER and WuLFF 78 
that copper has a surface tension of 1650 i 100 dynes/cm and a tem- 
perature coefficient of 0-55 dynes/cm/C. By the same procedure 
values of 1140 90 dynes/cm for silver 26 and between 875 and 932C 
and of 1400 65 dynes/cm for gold 12 over the range 1017 to 1042C 
have been given. The atmosphere appears to have a real effect ; the 
above data were obtained in helium ; one run in air gave a value for 
gold of 1207 75 dynes/cm. 

By another method BAILEY and WATKiNS 5 obtained values for copper. 
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They prepared grooves along the grain boundaries of copper by thermal 
etching at 800 to 900C and measured the angles at the bottom of these 
grooves between their sides. This angle was measured on specimens 
which had been heated in argon (^J, and on others heated in argon and 
lead vapour (< 2 ), and the contact angle (< 3 ) between copper and molten 
lead was also measured. By balancing surface tension forces one obtains 
the relation 



cos 2 

+ rPb/vapour C <> S 9s) 7X\ ( 2 ) 



The quantities on the right of this are known or measured in the 
experiments. Results again show that the atmosphere has an effect : 

800C in hydrogen . 1750 dynes/cm 
900C in hydrogen . 2000 dynes/cm 
900C in argon . 1600 dynes/cm 

Results from these two methods are in general agreement. RYMER and 
BuTLER 65 calculated a quite different value of 500 dynes/cm for gold in 
order to explain the anomalous surface lattice spacings measured by 
electron diffraction. 

Considerations of surface energy show that a pore should close up if 
the surface energy of the material is more than half of that of the 
grain boundary which would be formed ; for copper, it appears to be 
two or three times as great. 

Sintering may be considered as a simple rate process. If the logarithms 
of the times to reach a certain stage in the process, that is for some 
property to reach some special value, are plotted against the reciprocal 
of the temperature a straight line is obtained if the Arrhenius equation is 
applicable. For such a rate process a definite activation energy exists. 
HtJTTiG 39 measured the electrical resistivity of electrolytic copper 
powder after various times at different temperatures in hydrogen and 
obtained fair linear plots of this type, taking the times required to reach 
selected resistivities. He obtained an activation energy of 56,000 cal/g 
atom. DuwEZ 21 treated porosity data for copper in the same way and 
obtained an activation energy of 80,000 cal/g atom for sintering in 
hydrogen, but for the same copper sintered in vacuum he did not obtain 
the required straight line plots of log time against l/T. This would 
seem to mean that more than one activation process is involved in the 
sintering. The activation energy is shown by DUWEZ'S curves to fall as 
the sintering temperature is increased. 

FRENKEL 25 suggested that sintering solids flow as Newtonian viscous 
liquids, that is, so that the strain rate is linearly related to stress, surface 
tension being the driving force. The viscous flow is supposed to arise 
from the diffusion of individual vacant atom sites and is thus distinct 
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from plastic flow which is due to the orderly slipping of atomic layers in 
a crystal. He compares the sintering of two spheres with the merging 
of two drops of liquid. Equating the decrease of surface energy 
released to the energy dissipated as a result of viscous flow, he deduces 
that the area of contact, ?rr 2 , should increase at the steady rate 

-- - t. FRENKEL also found that gas pressure in a pore will stop its 
shrinkage when this equals the pressure associated with surface tension 



7PJf3 




X being the original pore radius and P the gas pressure. He showed 
how an activation energy may be derived as already described, and 
believes that it is the activation energy of self diffusion. SHALER and 
WuLFF 72 used FRENKEL'S model and calculated an activation energy 
of 85,000 cal/g for copper ; this value is rather higher than those 
obtained directly for diffusion (60,000 cal/g atom). For the viscosity 
coefficient of a metal rj FRENKEL gave the expression rj = RT/Da, 
where T is the temperature, a the lattice constant and D the self- 
diffusion coefficient. Taking published values for D this gave for copper 
at 1000C t] = 3*1 X 10 3 poises, a viscosity roughly comparable with 
Golden Syrup at room temperature ! 

NABARRO 60 has criticized FRENKEL'S treatment in that the move- 
ments of lattice vacancies do not provide a mechanism for viscous flow. 
Thus if a vacancy in a row of atoms moves from left to right the atoms 
in that row all move a distance of one vacancy from right to left relative 
to the rows of atoms above and below, but these two rows remain un- 
moved relative to each other, which means that there has been no 
movement of one large part of the crystal relative to the other. FREN- 
KEL'S treatment appears to require such a movement. NABARRO showed 
that, to be viscous, a crystal must contain mosaic structure or other 
imperfection in addition to vacant sites and its coefficient of viscosity 
will be about 10 8 times that given by FRENKEL'S expression ; this brings 
the value for copper at 1000C to the order of that of pitch at 0C. 

ALEXANDER, KUCZYNSKI and DAWSON* considered, from FRENKEL'S 
point of view, data for the creep of gold wires (0-001 in. diameter) at 
970C. They plotted strain against time for various loads and taking 
the strain rate from the most linear part of the curves plotted this 
against stress, obtaining a straight line. This meant that under these 
limited circumstances flow was viscous, and its coefficient of viscosity 
came out as 2-36 x 10 12 poises; it also gives a surface tension for gold of 
1280 dynes/cm 2 . The linear parts of the strain against time curves 
were given by the data for longer times ; the fact that the early parts 
are steeper and depart from linearity implies that during sintering flow is 
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not at first purely viscous and that the strain rate is higher than for the 
apparently viscous conditions. 

Using the same data and plotting initial strain rate against stress 
gave a curve consisting of two straight lines intersecting at what should 
be a critical shear stress ; it has a value of 7 kg/mm 2 (the measured 
value at room temperature is 9-2 kg/mm 2 ). The part of the curve below 
this stress corresponds to viscous flow and a viscosity of 1-4 x 10 12 
poises. Above the critical stress initial strain rate is again linear with 



X - COS0 




Fig. 0. CLARK and WHITE'S model of the sintering of two spherical 
particles ; the shaded section is through the "lens' ' forming the junction 14 

stress but it increases more quickly and the coefficient of viscosity is 
1*5 x 1C 12 poises. It is suggested that at low stresses viscous flow 
occurs by vacancy migration but for higher stresses a formula obtained 
by NAUARRO 60 and connecting strain rate with the size of a mosaic 
block, or other region over which the lattice is coherent, gives a strain 
rate in agreement with the experiment, assuming mosaic blocks to be 
10~ 4 cm across. 

SiTALER's 71 measurements of the widths of junctions between par- 
ticles should, when plotted against the sintering time, give a straight 
line through the origin if FRENKEL'S theory were correct. In fact the 
curves bend steeply towards the time axis during the first hours of 
sintering and, although they may become linear over long times (over 
20 hours), the linear parts do not extrapolate to the origin. SHALER 
thought the form of these curves to be evidence for two mechanisms ; 
he suggested an initial rapid flow by creep or slip, with only the slower 
viscous flow operating at longer times. 

CLARK and WHITE 14 have examined the possibility of viscous flow 
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of a layer of material of thickness fi to form a lens between two spherical 
particles. Their model is shown in Fig. 6. It is assumed that these 
particles remain spherical and that flow or diffusion maintains this form 
while matter transfers to the junctions arid the volumes of the particles 
decreases. Clearly this can be true only of the early part of sintering and 
even then this is not the picture that metallography has given of the 
changes in metal compacts. CLARK and WHITE made experiments with 
refractory oxides which perhaps come nearer to their model. 

Surface tension (y) will impose a positive hydrostatic pressure on 
material in the spherical particles and, because of their concave surface, 
a negative hydrostatic pressure within a lens. The pressure difference 
acting from sphere to lens will be 

p = Z ( 2 -\ -J ) .... (3) 

R \ 1 - x Vl - x 2 / 

It is supposed that flow under this pressure is in shear and occurs 
viscously by activated jumps of moving units. It is assumed that this 
flow occurs primarily in a thin surface film of the almost cylindrical 
lens and that most of the pressure drop at the slow rate of flow occurring 
will be over a short length I on either side of the plane A A (Fig. 6) to 
which all the moving material must converge. 

Flow into one lens of volume v can then be written as 



dv __ ~ fV, , vrPr 4 [4 

dt ~~ 

Here u r is the velocity of flow where radius r x has the value r and r is 
the radius of the stationary part of the lens, that is r = r l h. From 
the equation CLARK and WHITE obtain the change of volume shrinkage 
V with time as 



- -- 

dt \l//> + VI ' 2 r 1 - x 



(4) 



where V is the total volume of the lenses in 1 gram of specimen, q Q the 
initial pore volume in 1 gram, n the number of contacts with neighbours 
made by each particle and B = yh 3 /pr]lR 3 . Without varying tempera- 
tures the most significant term is rj which, for an activated flow process, 
varies so that B oc e~~ EIRT , where E is an energy of activation for the 
flow process. 

CLARK and WHITE tested their ideas about a sintering mechanism 
by obtaining experimental shrinkage-time curves and comparing them 
with calculated ones. 

So far the assumption has been latent in this model that lenses remain 
distinct from each other. In fact they will merge at some value of x 
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which will depend upon the packing arrangement of the particles ; if 
each particle had eight neighbours this will be when x is about 0-8. 
Actually CLARK and WHITE found that curves derived from their 
equations agreed with experimental data when taken beyond this 
expected limit to the range of their applicability. 

In this way CLARK and WHITE obtained curves giving good fit with 
experiments made on a glass at temperatures of 598, 626, 648 and 
676C. This material had a density p, of 2-54 g/cm 3 , g the pore volume 
per gram was 0-21 cm 3 , the surface tension y was about 400 dynes/cm 
and at 676C the viscosity was about 10 7 poises ; n was assumed to be 8. 
Experimental results obtained with alumina or with beryllium oxide did 
not agree with curves calculated on this basis of simple viscous flow. 
For these materials a Bingham type of flow with a critical shear stress 
r c was considered ; this gave rates of strain proportional to the stress 
due to surface tensions when reduced by subtracting the quantity T C . 
This gave a modified shrinkage curve : 



i - x vr=^~ 2 ^ yv+Tp (i + 

the term in curly brackets being a dimensionless number. 

Curves were calculated from this which agreed well with experiments 
made on a- and y-alumina and a beryllium oxide. 

The quantity B = yh^/prjlE 3 varies with temperature and in this 
the change in 77 is the most dominant factor. This varies with tempera- 
ture according to the Arrhenius activation energy relationship 
B oc e~ EIRT , where E is the activation energy for the flow process. It 
follows that log B should give a straight line when plotted against I/T. 
For alumina CLARK and WHITE found this true and it also held roughly 
for magnesia. For glass, however, the relationship obtained gave a 
curve which implied a rise in the activation energy with falling 
temperature. The activation energies for viscous flow of glass do 
vary in this way. Values of the activation energy obtained were 
87,000 cal/mole for alumina and, most probably, 43,000 cal/mole for 
magnesia. 

It appears probable from the metallographic work on the "sintering" 
together of wires, or that of spherical particles to planes, that the par- 
ticle radii all vary little throughout a large part of the changes during 
which contact radii grow. This seems generally true until, in a powder, 
the pores would be sealed off, and this commonly occurs when the 
porosity is about 15 per cent of the total volume. In deriving their 
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shrinkage time relationships CLARK and WHITE assumed the particle 
radii to decrease uniformly. Their further assumption that the 
thickness, h, of the moving layer remains constant throughout sintering 
would appear open to question in the cases of solids giving Bingham 
flow. The assumption would imply that as the lens grows the thickness 
over which stress exceeds the yield value is not dependent upon the 
radii of the lens, r l and r 2 . 

MACKENZIE and SnuTTLEWORTH 67 have considered viscous or plastic 
flow of a mass containing uniformly distributed sealed-off spherical 
pores. Surface tension on the pores, all of radius X, is considered equi- 
valent to an external pressure of + 2y/X applied to a solid which is 
assumed to be incompressible, so that the effect is uniform through any 
size or shape of sintering material. The effective pressure in copper 
would be that of an applied load of one atmosphere. An equation of 
state, that is an equation linking strain rate with stress at constant 
temperatures, was sought, and again viscous flow and Bingham flow 
were examined. In this work no atomic mechanism was considered, but 
by equating energy from reduction of surface area with energy dissi- 
pated in flow, equations for shrinkage with sintering time were again 
developed. 

MACKENZIE and SHUTTLEWORTH do, however, argue that flow must 
arise from a macroscopic mechanism in which atoms move in assemblies 
and not from any process in which they move as individuals, which, 
they say, would inevitably be too slow to explain the observed rates. To 
illustrate this they show that if the excess concentration of vacant 
lattice sites in copper at 1000C due to pores of 10~ 4 cm radius 
in a specimen giving an average distance of 1C" 1 cm from the surface, 
then shrinkage due to volume diffusion would be at a rate of about 
10~ 6 /hour. If sintering were by volume diffusion, however, it should 
occur from the surface inwards as diffusion of vacant sites to the 
surface would not occur from any pore while other pores screened 
the surface from it. While such an effect has been recorded 61 there 
is not sufficient evidence to show that it is a general phenomenon 
of sintering ; plastic mounting of a specimen for metallographic 
examination is itself able to develop what appears as this effect. 

The first step in this derivation of shrinkage-time relationships is to 
consider the shrinkage of one pore of radius X surrounded by a spherical 
shell (radius X%) of non-porous matter, the whole placed in a mass of 
uniform matter (that is, the rest of the sintering specimen). The stress 
in this mass being everywhere a continuous hydrostatic pressure and the 
overall density of this pore with its shell being that of the whole mass, 
the radial strain rate is 

du 2u 1 X l 2 
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where u is the radial velocity at the surface of the pore. Derivation of 
this takes from the theory of viscosity the relationship dO/dt 
3 . (du/dX) 2 . 77, where G is the energy. If there were only one isolated 
pore in a specimen the outer radius of the shell would be infinite. By 
putting a limiting thickness to the shell, that is giving X 2 a finite value, 
the interaction of other pores with one under consideration is taken into 
account. This treatment is strict for small values of X-^jX^ but it is 
not clear how accurate it is when this quantity is large, that is when 
the pores are close together. This means that MACKENZIE and SHUTTLE - 
WORTH'S expressions should hold more rigidly during the later stage of 
sintering. 

A solid with Newtonian viscosity has strain rate proportional to 
stress and viscosity rj independent of strain, so that the radial velocity 

y 1 

is u = . - . For an isolated pore p = I and the factor Ijp re- 
2?7 p 

presents the effect of all other pores on the rate of sintering of pne 
pore. In a powder compact p is generally more than 0-5, so that a pore 
in a compact will not close up at rates greater than twice that of an 
isolated pore. 

The rate of density change of a Newtonian solid with time during an 
isothermal sintering run is given in terms of the number of pores, m, 
in a unit volume, the surface tension and viscosity by : 

m 1 / 3 2 / 3 ' 8 



For a Bingham solid of critical shear stress r c the viscosity at any 
moment is dependent on the strain, and the shrinkage of a pore, 
dXJdt (radial velocity of pore surface inwards) was found to be 

_dX_ _y_ 
dt 2r/^ i 

where r\^ is the ultimate value to which r\ tends. 

With such materials a pore will not close unless 2 

> \/2T c In ; this means that there is a critical size of pore 

1 p 

such that material containing only larger pores will not increase in 
density. 

The rate of increase in density is given by the equation : 



dt 

,1/3 .... (7) 



where a = V 2 | | 
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From this equation values of density can be calculated and plotted 
for any selected value of a against a reduced time ym 1/3 ( )A?oo- It 
is clear from the equation defining a that this quantity is zero for the 
case of Newtonian viscous flow, that is when r c is zero, and that it 
increases with r c ; this quantity a also depends upon the number of 
pores per unit volume, so that dp/dt depends upon this quantity. From 
equation (7) dp/dt can be seen to be positive, that is, sintering with 
shrinkage t Q occurs, only when the expression in the square bracket is 
positive. This is generally true if a is small, but when a > 0-89 the 
value of dp/dt falls to zero as the square bracket term itself falls to 

zero, or when (l/p 1) 1/3 In I I = I/a. If pressing before sintering 

\1 p/ 

had given a density greater than 0*94 of that of the massive material 
sintering could go on to the theoretical density. 

The term (l/p l) 1/3 ln( I tends rapidly to zero as p tends to 

\1 p/ 

unity, which means that when pores are very small or very far apart 
they will cease to close. 

MACKENZIE and SHUTTLEWORTH calculated from strain experiments 
on copper wires made by UDTN, SHALER and WULFF ?S that the critical 
shear stress of copper is greater than 2-0 x 10 5 dynes/cm 2 at 1000C, and 
from sintering experiments 73 that it is less than 2-6 x 10 5 dynes/cm 2 
at 850C. At room temperature the value is 10 7 dynes/cm 2 . 

KuczYNSKi 51 examined theoretically what the rate of growth of the 
contact between a sphere and a plane surface on heating should be in 
the cases of mechanisms based on viscous or plastic flow, on volume 
diffusion, on surface diffusion and on evaporation and condensation. 
He found by his experiments what power of the contact radius was 
proportional to "sintering" time and compared these results with 
predictions obtained from his considerations of mechanisms. 

For an evaporation and condensation mechanism he takes the pres- 
sures of vapour p and p 2 over surfaces of radii R and r 2 as given by the 
Kelvin equation, 

P<z __ 2ya 3 /I I \ 
p~t ^ kT \r~ B) 



In 



a being the interatomic distance in the solid. The rate of movement of 
matter at one temperature should be proportional to the pressure 
difference (p l p 2 ), and as R is effectively constant, p l does not alter, 
and so the rate of movement of matter is inversely proportional to r 2 , 
that is the rate of movement M = k'/r 2 , where k' depends on tempera- 
ture and R. Expressing r 2 in terms of r l9 the quantity M was shown to 
be proportional to 1/r, 2 where M is now the rate of condensation on the 
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junction of area 
J 



Putting M oc r . KUCZYNSKI found that 
e 



the cube of the junction radius is proportional to the time, r^ oc t. 

Volume diffusion was considered in terms of the movement of vacant 
lattice sites. Near the junction the radius of curvature is small ; an 
increase in volume here would reduce surface energy and could 
occur if the concentration of vacant sites increased. This concentration 
depends upon the curvature of nearby surfaces so that there will 
be a concentration gradient of vacant sites between the junction 
1 'lenses" and the rest of the sintering particles and vacant sites 
will diffuse away from the junctions or atoms will come into 

them. Kelvin's relationship may be written as A(7 = - . e~ E ' IRT 9 

where C is the concentration of vacant sites and E' is the energy 
required to produce one vacancy. If D v is the coefficient of volume 
diffusion, that of diffusion of holes is D' = D v e E ' tRT and Fick's equation 



is 



R 



dt 



combining these gave 



kT 



(8) 



The case for surface diffusion has been examined by CABRERA 13 and 
ScHWED 70 as well as by KUCZYNSKI. They have reached different 
relationships between junction radius and time. The equilibrium 
concentration of adsorbed atoms or of vacant sites at any point will 
depend upon the curvature of the surface there. There will thus be 
different concentrations at the junction and at surfaces of larger radii 
and this will be such that atoms flow to the junction. The equilibrium 
concentrations of adsorbed atoms or vacant sites on any surface may be 

kept up by condensation from 
vapour or from growth dislocations 
in the crystal surfaces. 

CABRERA considered the surface 
as of two parts with uniform con- 
centrations of adsorbed atoms over 
each and a discontinuity at the 




Fig. 7. Section through a spherical 

particle sintered to a plane surface as 

in KUCZYNSKI' s experiments 51 



junction. According to SCHWED, 
KUCZYNSKI considered only a 
vacancy concentration gradient in 
the neck. SCHWED argued that 
these models are not adequate. Setting up an equation for diffusion past 
the junction A in Fig. 7, and avoiding these restrictions but assuming 
the flow to be balanced at any point by interchange of atoms between 
normal surface positions and sites which steps in the surface enfold 
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where they bend, he finds that for small values of the radius r z the 
growth law is r t 3 oc t ; where r 2 is larger, the relation is r^ oc t. Here r 2 
is to be considered small or large if it is much smaller or greater than 
a distance (D 8 t s ) ll2 /7r f where t a is the relaxation time for condensation 
of atoms on the surface. It is not easy to estimate this distance with 
any certainty. Putting D s = 10~ 5 cm 2 /sec and r x = 10~ 4 cm would 
imply a relaxation time of 10~ 2 seconds as standing between the two 
sintering laws. 

KUCZYNSKI originally proposed a law r^ oc t for surface diffusion 
and CABRERA, amending the argument, gave r^ oc t, which would not 
allow distinction between surface and volume diffusion. 

These predicted relationships, all of the form (r^ K oc t, may be 
summarized as a table. 



Sintering Mechanism 




K 


Viscous or plastic flow 




2 


Evaporation condensation 




3 


Volume diffusion 




5 


Surface diffusion (SCHWED) 


3i/r 


L is small 




5i/r 


t is large 



According to SCHWED, atom or site replacement by equilibrium 
reaction alone gives a flow which leads to no net junction growth as 
atoms arrive or leave at equal rates by this means. 

KUCZYNSKI examined experimentally the behaviour of copper, 
silver 51 and glass. 50 He sintered round particles on to flat surfaces and 
then sectioned these and measured the radii of the junctions. This 
must be a very difficult experiment. One of these specimens is shown in 
Fig. 4. For particles of copper (4 x 10~ 3 cm radius) or of silver 
(2 x 10~ 2 cm radius) he obtained values of log (r/R), which gave linear 
plots against time, t ; the slopes varied between 4-6 to 5-4. This corres- 
ponds with his deduced value of 5 for a volume diffusion mechanism. 

With very much smaller copper particles (5 x 10~ 4 cm diameter) 
KUCZYNSKI'S experiments gave a value of K of about 6-5, which lies 
closer to his prediction for a surface diffusion mechanism, but in the 
light of CABRERA and SCHWED 's work this result is not easy to explain, 
and in fact the result for very small copper particles is inconsistent with 
the theory. 

In his experiments with a glass KuczYNSKi 50 used beads of c. 2-5 X 
10~ 2 cm radius and obtained excellent straight-line plots of (/i/JR) 2 
against t. (The data tabulated in his paper contain some misprints.) 
This is his relationship for a viscous flow mechanism of sintering. 
KUCZYNSKI'S work is thus a very vivid indication that sintering can be 
dominated by quite different processes and that the nature of the 
material is an important factor in determining which are important. 
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MACKENZIE and SHUTTLEWORTH suggest that in KUCZYNSKI'S experi- 
ments shear stress will be zero except just at the interface and con- 
sequently any plastic movement of the bulk of the material is 
impossible. The elimination of plastic or viscous flow would make it 
possible for some other, and perhaps normally subsidiary, mechanism 
to dominate the process. 

There are very few data on the mechanical properties of metals under 
small stresses at sintering temperatures. ALEXANDER, DAWSON and 
KLING 3 have obtained creep curves on 0-0028 cm diameter gold wires 
under small stresses at 920 to 1020C. At such low stresses the 
minimum strain (creep) rate was proportional to the stress; thus at 
1020C strain rate was linearly related to the stress up to about 
10~ 3 dynes/cm 2 , at which stress the strain rate was about 8 x 10~ 6 per 
second. This result means that the flow observed under these conditions 
was viscous ; if viscous flow is possible under such minute stresses it is 
surprising that KUCZYNSKI'S experiments suggested diffusion as the 
mechanism of sintering under small stresses. The explanation has been 
put forward 4 that in sintering experiments of this type the flow occurs in 
the small neck of sharp radius r 2 = 10~ 4 cm, which is about the dimen- 
sion of a mosaic block and that, as KUCZYNSKI'S experiments showed, 
this flow is by volume diffusion. In an aggregate of mosaic blocks, such 
as the gold wires, the creep Experiment suggests that volume diffusion of 
vacancies within the blocks could lead to viscous flow of the specimens. 
Thus the same mechanism could appear as a diffusion process in sin- 
tering and as a viscous flow of the wires. 

MURRAY and WiLLiAMS 58 have considered the application of the 
MACKENZIE and SHUTTLEWORTH equation, (7), to hot pressing, that is 
when a pressure is applied at sintering temperature to a mass containing 
closed pores. This pressure would require a term to be added to the 
pressure due to surface tensions. They believe that for hot pressed 
metals (dp/dt)p T is proportional to L u , where L is pressure and U has 
a value normally between 3 and 2, but which falls as T rises. This 
power relationship could arise if a creep process were involved in 
sintering, creep rate being proportional to a power of the stress causing it. 
Pressure is not known to influence diffusion constants and so should not 
affect sintering due to diffusion. However, pressure alone will compact 
a metal and its effect would be added to that of any other process. 

Diffusion appears to be much faster along grain boundaries than 
through grains, and as there is no indication that traces of impurity 
affect bulk diffusion markedly, the difference cannot be due to congrega- 
tion at boundaries of foreign atoms. HOFFMANN and TuRNBULL 38 
diffused radioactive silver into the normal metal and by then taking 
auto-radiographs showed the active material to be concentrated at 
grain boundaries ; its concentration fell away with depth in the silver. 
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In a rather similar study of diffusion of active silver into copper 
ACHTER and SMOLUCHOWSKi 1 found the rate of penetration to increase 
as the difference of orientation between the grains on either side of 
the boundary increased. Values of lattice (D v ), boundary (D b ) and 
surface (D 8 ) diffusion coefficients for the diffusion of silver and thorium 
into tungsten are collected into the table : 

D = D 





D v 


D b 


D 8 


Silver into tungsten 








D (cm 2 /g) 


0-895 


0-025 


0-16 


E (cal/g atom) . 


45,950 


20,200 


10,300 


Thorium into tungsten 








D (cm 2 /g) 


1-0 


0-74 


0-47 


E (cal/g atom) . 


120,000 


90,000 


66,400 



The width of a band of silver through which silver can pass by volume 
diffusion at the same rate as it does by boundary diffusion along a 
boundary is : 



Temperature 
Width . 



200 
45 



400 
5 X 10- 2 


600 
1-3 x 10~ 3 


800 
1-3 x 10-* 


960C 
3-6 x 10- 5 cm 



This width must be the minimum grain diameter if bulk (lattice) 
diffusion is to dominate the diffusion transport of material. 

GlREENOUGH 30 studied the extension of silver wires under small 
stresses. His measured strain rate of 7*61 X 10~ 9 per second at 
920C agrees with that of 2-66 X 10~ 9 per second calculated by 
HERRING. This calculation assumes that mosaic boundaries cannot 
function as sinks for diffusion currents and that diffusional viscosity 
can only account for creep at very high temperatures and low stresses. 
HERRING supposes that these conditions apply in sintering and that 
grains may slide over each other tangentially as they grow normally to 
their boundaries. 

In single crystal specimens of silver GREENOUGH 30 found no measur- 
able strain, which indicates that plastic flow within grains makes no 
contribution to creep of the wires. His results for crystals 0-1 cm 
diameter tested for 282^ hours at 910C were : 



Specimen 

Single Crystal 1 . 
Single Crystal 2 . 
Polycrystallino . 



Stress 
dyries/cm 2 


Strain 


131,000 
114,000 
126,000 


+ 0-005 
+ 0-001 
+ 0-058 
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HERRING 36 has examined the effect of the linear dimensions of a 
sintering system for different sintering mechanisms. Thus, if two com- 
pacts contain particles, in one case of radius R l9 in the other of R 2 where 
R 2 = A.R! and if after intervals A^ and AZ 2 the contacts have radii (r l ) l 
and (r^ where (1^)2 equals A^), HERRING sought to relate the times 
Atfj, A 2 with the mechanisms of sintering. 

When considering a viscous flow process in a crystalline material it is 
not correct to take the expression P = 2y/R as the pressure inside a 
particle of radius R, for the stress will vary over the surface and 
shearing stresses will occur in the interior. For viscous flow HERRING 
finds A^ 2 = AA^. For evaporation and condensation when sintering 
particles of radius 10~ 5 cm or more A 2 = A 2 A^. Volume diffusion would 
give AZ 2 = A 3 A^. With surface diffusion A 2 = A 4 A t , if it is assumed 
that rate of migration is proportional to the gradient of a chemical 
potential. Experiments have not yet been made on this line. Great care 
will be required to avoid surface contamination of a specimen and to see 
that one crystal orientation does not dominate one sample. 

To explain the increase in average pore size of pressed copper speci- 
mens which they observed on sintering RHINES, BIRCHEN ALL and 
HuGHES 63 consider volume diffusion. They found that the concentration 
of small pores decreased sharply with sintering time (Fig. 3) indicating 
that the shrinkage of small pores is very high ; on the other hand, the 
largest pores, those of volume more than 10 9 cm 3 , increased in volume. 
There is nothing in any slip mechanism of sintering to explain the 
growth of pores. The authors suppose it to be due to volume diffusion, 
vacant lattice site concentrations being higher in the vicinity of small 
pores than of larger ones, so setting up concentration gradients and 
diffusion. This implies that rapid maintenance of equilibrium concen- 
trations occurs by the development or condensation of vacancies. 
Diffusion of this type between pores and the external surface of a 
specimen is suggested as the cause of overall shrinkage. 

By assuming that the thermal energy of a crystal may be assigned to 
sites according to the Boltzmann distribution one may calculate the 
fraction,/, of sites which at any temperature have more energy than that 
required to form a vacant site. This energy was taken as 39-700 cal/g 
atom. This calculation gives the following values of/: 



Temperature (C) . 





700 
1-2 X 10- 


800 
8-0 X 10- 


900 
3-9 x 10- 8 


1000 
1-5 X 10~ 7 



The Kelvin equation is taken as giving concentrations of vacant sites 
near curved surfaces. Using these values, RHINES, BIRCHENALL and 
HUGHES estimate the size of pore which will neither grow nor shrink in 
a rather idealized model. In this each pore is supposed to have nine 
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neighbours ; it gains vacant sites from smaller pores and loses them to 
larger ones or to the free surfaces. The proportion of sites leaving one 
size of pore which reaches pores of any other size depends upon their 
area of cross section. 

The results agree quite well with observation for long sintering times 
at high temperatures. It is suggested that quite large pores will never 
be removed by heat alone. Another effect forecast is that a non- 
wetted inclusion above a minimum size ought to provide a surface for 
vacant sites to discharge at, and so come to occupy a pore larger than 
itself. The shape and size of a specimen should also affect sintering 
rate ; it seems improbable that this is so. 

RHINES, BIRCHENALL and HUGHES think that evaporation and 
condensation could provide a rapid enough change of pore shape ; at 
1000C they estimate a rate of condensation at sharp junction angles 
of about 10~ n g/sec; in a typical case of sintering about 10~ 9 g of 
material would flow into the junction during the rounding of pores. 

It becomes gradually more clear that under properly controlled 
circumstances the mechanism of sintering may be dominated by one or 
other of different processes, particularly by viscous flow or by diffusion. 
There is still uncertainty about what factors are most important in 
selecting a mechanism, and so this cannot yet be set up from first 
principles and conditions for sintering then forecast. Moreover, many 
simple points of the course of sintering are yet unsettled ; whether, for 
instance, pores do in general become spherical rapidly or only do so 
much more slowly than the shrinkage changes. The general course 
of sintering is still uncertain; it is known that much porosity is 
eliminated before pores are isolated, that small pores are removed 
before larger ones which may even grow; that the rounding of 
pores to spherical shape is of secondary importance ; that grain 
boundaries running into pores may be important in their shrinkage ; 
that conditions which favour vacant site formation make sintering 
quicker but that transformation temperatures are themselves of small 
importance ; it is not clear whether porosity can be wholly removed by 
sintering alone. It would be valuable to have such points decided more 
certainly. 

The author is indebted to his colleague Mr. F. 0. JONES for frequent 
valuable discussions in connection with sintering ; he wishes to thank 
Dr. T. E. ALLIBONE, F.R.S., for permission to publish this article. 

NOTATION 

A Point of junction between the original surface of a sintering particle and 
of the surface of material which has moved during sintering (Figs. 6 
and 7). 

a Interatomic distance. 

B Proportionality constant in equation (4). 
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C Concentration of vacant lattice sites. 

D vt D g and D b Diffusion coefficients for volume, surface and grain-boundary 

diffusions. 

D ' Value of a diffusion coefficient at 0Abs. 
D H Diffusion coefficient for lattice holes. 
E Activation energy. 
E' Energy to produce a vacant lattice site. 
/ , Fraction of lattice sites which are vacant. 

Energy. 

h Thickness of a flowing layer of matter. 

K Power of the junction lens radius, r lf which should be proportional to 

sintering time, t, for any given mechanism. 
k Boltzman constant. 
L Compacting pressure. 

1 Short length at the junction between particles over which the drop in 
internal pressure occurs. 

M Rate of movement of matter. 

m Number of pores per unit volume. 

N Number of grains per unit length of a wire in which each grain fills the 

whole cross section of the wire. 

n Number of contacts made by a particle with its neighbours. 

P Pressure inside sintering particles. 

P Gas pressure. 

Pv Pz Vapour pressures over surfaces of different radii. 

q Total pore volume per gramme of material. 

R Particle or wire radius. 

R Gas constant. 

r Radius. 

r l Lens radius (Figs. 6 and 7). 

r 2 Radius of the surface of a lens (Figs. 6 and 7). 

T Absolute temperature. 

t Time. 

t a Relaxation time for condensation of atoms on a surface. 

U Index for hot-pressing relationship. 

u Radial velocity at the surface of a pore. 

V Lens volume in one gram of material. 

V Shrinkage in volume when one gram of material is sintered. 

v Volume of a single Clark and White 'lens.' 

W Weight under which a loaded wire neither extends nor contracts when 

heated. 

w Weight. 

X Radius of a pore. 

x cos 0. 

Q \ 1/3 

C 



( 



m 



2 W 

Densification parameter. 
y Surface energy. 
y B Grain boundary energy. 
d Height of a lens (Fig. 7). 
r\ Coefficient of viscosity. 

T?OO Ultimate coefficient of viscosity of a Bingham solid. 
6 Half the angle subtended at the centre of a particle by the junction with 

another particle. 
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A Herring's scale factor. 

P Density. 

T C Critical shear stress of a Bingham solid. 

#1, #2, fa Angles. 
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5 
THEORY OF DISLOCATIONS 

A. H. Cottrell 

INTRODUCTION 

THE fact that a second review of progress has become necessary, so 
soon after the first 1 (which will be referred to as T.D.I), shows that the 
theory of dislocations is being opened up rapidly. Some of this progress 
could have been anticipated from the ideas that were taking shape at the 
time the first review was being written but an important contribution 
has come from newer ideas, some of which have caused a radical change 
in outlook in certain parts of the theory. Judging from present-day 
ferment on topics such as the growth of crystals, work hardening, and 
the nature of slip bands, this pace of progress will be held for at least the 
next five years. 

When the first review was written the broad position was as follows. 
Arguments about the impossibility of the simultaneous movement of 
all the atoms in a slip plane and about the low yield strengths of crystals 
had convinced most people that slip takes place by the movement of 
dislocations. Simple types of dislocations, the edge and the screw, had 
been described and the newly discovered phenomenon of polygonization 
had strengthened belief in at least the first of these. The elastic proper- 
ties of dislocations were well understood and a detailed mathematical 
theory of their stress fields, strain energies, flexibility, forces of inter 
action, and mobility in simple lattices, already existed. The dynamical 
properties of dislocations were mainly unknown, although it had been 
recognized that dislocations could not move faster than sound, and that 
near this speed they would have enough kinetic energy to create new 
dislocations from themselves. This dynamic creation of dislocations was 
thought to be the basic process in the growth of slip bands. The source 
of the original dislocations was still a mystery, however. 

Age-hardening had already been discussed in detail from the point 
of view of dislocations and a start had been made towards explaining 
the yield point of iron in terms of the anchoring of dislocations by 
segregated impurity atoms. The theory had also been applied to the 
analysis of the structures of grain boundaries, but not to the strengthen- 
ing effect which they have. Several attacks on the work hardening 
problem had been made, but these had only penetrated far enough to 
show how formidable it really is. So long as it remained unsolved there 
could be little progress, apart from preliminary skirmishes, with the 
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closely related problems of recrystallization, creep, fatigue, and 
ductile fracture. 

The most exciting of the new developments has been in crystal growth. 
Difficulties in the classical theory led to the idea that the growth of real 
crystals is catalysed at places where dislocation lines meet their surfaces. 
Many crystals contain dislocations because they could not have grown 
otherwise. Spectacular evidence supporting these ideas has come with 
the actual observation of the "growth spirals" predicted by the theory ; 
it is now possible to locate, and even "see," dislocations in certain 
crystals. 

An important new idea is the "Frank-Read source." This shows that 
certain geometrical arrangements of dislocation lines are able to create 
from themselves large numbers of new dislocations by a process not 
involving kinetic energy. This idea of static creation is not only oppor- 
tune, because doubts have been raised about whether dislocations 
can move fast enough for dynamic creation ; it is also something long 
needed for understanding slip bands, deformation twins, and work 
hardening. Another important idea is the creation and absorption of 
vacant atomic sites at dislocations. Things that may depend on this 
are the Kirkendall effect in diffusion, polygonization, work hardening, 
and annealing. 

Various advances have been made in the elastic theory of dislocations, 
including the distribution of dislocations in slip bands, the energies of 
small-angle crystal boundaries, the effects of interatomic forces on the 
widths and mobilities of dislocations, and numerous effects concerning 
the interactions of dislocations with each other, with grain boundaries, 
and with other kinds of elastic singularities. The forms of dislocations 
in various crystal lattices have begun to be examined, and this has led 
to the recognition of various "imperfect" dislocations that line the 
boundaries of sheets of stacking fault in the lattices. These imperfect 
dislocations are believed to be important in determining slip planes, in 
deformation twinning, and in work hardening. 

The theory of the yield point and strain ageing has been elaborated 
and experimental support gained for some of its predictions. Problems 
concerning the obstruction of slip at grain boundaries have been clarified 
and the attack on work hardening has continued. Observations on 
cross-slip, multiple slip, deformation bands, and the fine structure of 
slip bands, have suggested new ways of looking at the work hardening 
problem. In particular, interactions of dislocations other than the 
classical elastic ones have attracted some attention, for example, 
parallel dislocations on intersecting slip planes may sometimes run 
together to form immobile dislocations. Although the work hardening 
problem is not yet solved, or the problems of recrystallization, creep 
etc. that are related to it, several new advances have been made and 
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we can at least now say that dislocations seem to have that versatility 
in their properties that the complexities of these processes demand. 

A satisfying recent trend in the theory as a whole is that it is be- 
coming more quantitative, i.e. making quantitative predictions that 
can be tested experimentally. Two examples of this are the dependence 
on orientation of the energy of small-angle boundaries, and the time 
law of strain-ageing in iron. 

THE GROWTH OF IMPERFECT CRYSTALS FROM VAPOURS AND 

SOLUTIONS 

The driving force that makes a crystal grow from its vapour is deter- 
mined by the ratio of the pressure of the vapour to the vapour pressure 
of the solid. Growth from a liquid solution similarly depends on the 
degree of supersaturation, while growth from a melt depends on the 
degree of undercooling. 

The process by which a perfect crystal should grow has been analysed 
in detail in the classical theory of GIBBS, VOLMER, BECKER and 
DORING, and others (reviewed in references 2, 3, 4). Although the 
lattice is perfect, the crystal may have imperfect faces which consist of 
terraces of steps, each step being the edge of a single sheet of atoms or 
molecules which covers part of the face. Growth can occur easily on 
these imperfect faces, even when the degree of supersaturation is small, 
because atoms can deposit in the re-entrant corners provided by the 
steps and become firmly attached to the crystal by bonding with two or 
more of its atoms. This state of affairs cannot continue indefinitely, 
however. As the steps advance by the repeated attachment of new 
atoms they leave behind themselves smooth close-packed crystal faces 
on which there are no projections against which later atoms could lodge 
themselves. The crystal cannot grow further unless atoms condense on 
these smooth faces and assemble together to make a nucleus for the 
next layer. This is difficult because isolated atoms or small groups of 
them can be thrown off easily by thermal agitation, and the chance of 
a sufficient number of them coming together to make a permanent 
attachment to the crystal is small if the vapour is only mildly super- 
saturated. This conclusion can be verified by mathematical analysis 
of the nucleation process 2 ' 3 > 4 and it has been shown in this way that 
perfect crystals could only grow at the rates observed with real crystals 
if the degree of supersaturation exceeded about 50 per cent. 

Most experiments contradict this result. For example, VOLMER 
and ScHULTZE, 5 working on the growth of inorganic and organic 
crystals from vapours, showed that the rate of growth is practically 
as fast as the rate at which molecules strike the surface from the 
vapour, even when the supersaturation is only about 1 per cent. More- 
over, the rate of growth is directly proportional to the degree of 
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supersaturation, whereas the theory predicts a spectacular exponential 
dependence. 

FRANK S 4 realized that neither the theory nor the experiments were 
wrong, but that they applied to different crystals. The crystals which 
grow so easily in practice are imperfect. In particular they contain 
dislocations. The importance of dislocations in crystal growth can be 
seen as follows. What is needed for continual and rapid growth is an 
imperfection of the crystal faces which ensures that growth steps are 




Fig. 1. A screw dislocation AB which ends at a point B on a 
crystal face 

always on them, no matter how much they grow. As FRANK pointed 
out, this condition is ensured if screw dislocations end at these faces (or 
edge dislocations if these approach the surface obliquely so that there is 
a component of slip standing out of the surface). Fig. 1 shows a 
crystal face on which there is a point of emergence B for a screw dis- 
location AB in the crystal. From the definition of a screw dislocation 
(T.D.I, page 79) the direction of slip is along AB, so that a step BC is 
formed on the face running outwards from the point of emergence. 
Growth can occur by condensation along the step, and a new layer is 
added as the step rotates about its point of emergence. 

The essential point is that this step never disappears. Because of its 
special topological property of ending in the middle of the face, which is 
a consequence of the geometry of the dislocation, the step is always 
present no matter how much the crystal continues to grow. It does not 
retain the simple form shown in the diagram, of course. Since that 
part of the step near to the point of emergence requires the addition of 
only a few atoms to make one revolution, whereas outlying parts need 
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large numbers of atoms, the centre rotates faster than the outside, and 
the step winds up into a "growth spiral/' as in Fig. 2. 

FRANK'S theory also considers growth from several points of emer- 
gence on the same face. The rate of growth is, over a wide range, not 
very sensitive to the density of dislocations which emerge on the face. 
An important case is that where two dislocations of opposite signs 







Fig. 2. A single growth spiral 



Fig. 3. Stages in the formation of a growth 

ring from a step joined to two points of 

emergence, A and B 



emerge and are joined by a common growth step. This step generates 
not a continuous spiral but a series of concentric growth rings, as in 
Fig. 3. 

Striking confirmation of this theory has been provided by the actual 
observation of growth spirals and rings, showing all the features 
expected from the theory, on the faces of various crystals. Fig. 4 
shows the original example by GRIFFIN 6 on a crystal of beryl. This has 
been the fore-runner of many others observed on various inorganic and 
organic crystals. 7 By using electron microscopy, multiple-beam inter- 
ferometry, and other optical methods, it has been proved that the 
steps are equal in height to the lattice constant of the crystal, or are 
small multiples of that height. 

Because of their small lattice constants, and for other reasons, metals 
are not the easiest materials for showing growth steps. However, by 
depositing large molecules on them, the steps can be made visible, and 
this method has been used by FORTY S to show growth spirals in a 
magnesium crystal grown from a vapour. Fig. 5 is an example ; the 
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height of the step in this case is probably equal to the c axis of the 
crystal. 

FRANK'S theory explains the growth of imperfect crystals, not their 
nucleation, and separate considerations are needed to decide how the 
dislocations originate in the imperfect nuclei. FRANK suggests various 
possibilities for this, e.g. (a) nucleation in regions of high supersaturation, 
(6) plastic buckling of very thin and fragile nuclei, (c) nucleation on an 
imperfect substrate, (d) buckling under large strains created by a 

gradient of composition, and (e) the collapse 
of sheet -like cavities formed by the aggrega- 
tion of vacant atomic sites. From a practical 
standpoint the origin of these imperfect nuclei 
is not important ; what matters is that they 
grow in preference to perfect ones. 




THE STATIC CREATION OP DISLOCATIONS 

Fig. 6. Network structure T i^rrv A r TI-^J. u i_ T-I 

, - ,. , .. In 1950. at a conference in .Pittsburgh, FRANK 
made from dislocation 1T ^a TI 

li nes and READ 9 pointed out that slip bands might 

grow by a dislocation mechanism similar in 

some ways to that for the growth of crystals. The dislocations concerned 
in crystal growth need not lie in slip planes. In fact, as FRANK has sug- 
gested, they are likely to form a three-dimensional network structure, as 
shown in Fig. 6. In such cases we can expect dislocation lines which for 
part of their length lie in a slip plane, and then at some point turn out of 
this plane along an intersecting direction, or join other dislocations 



Fig. 7. The formation of a slip band by the rotation of a 
dislocation line BC about an internal point of emergence B 

lying in other planes. A simple example of one of these Frank-Read 
sources is shown in Fig. 7. Here an edge dislocation ABC forms the 
internal boundary of a quarter-plane ABCD of atoms. Suppose that 
the dislocation line BC can move freely in its slip plane CEF and that 
the part AB is for some reason unable to move. Then B is an internal 
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point of emergence and the dislocation BC must remain attached to it 
as it glides in its slip plane. The dislocation rotates about this point and 
winds up into a rotating spiral, as shown, which produces a translational 
slip of one atom spacing each time it makes one revolution. 

Pieces of dislocation with two internal points of emergence, such as 
those in Fig. 6, can generate slip bands by making dislocation rings. 
Fig. 3 illustrates this process if the lines in these diagrams are thought 
of as dislocations. When the scale / of the network structure is small 
compared with dimensions of the crystal most of the Frank-Read 
sources will be of this type. Only those within a distance I from the 
surface will be of the single-ended type shown in Fig. 7. 

The stress needed to start a source working is important. Consider a 
double-ended source, such as those in Fig. 6. The dislocation has a 
line tension (T.D.I, page 117). This causes it to contract in length until 
it is a straight line joining its two points of emergence. Before the 
source can be set in action this line has first to be bent into a loop of 
radius 1/2 (Fig. 3). The applied stress a needed to do this (T.D.I, 
equations 28 and 29) is about 

pb/l, .... (1) 

where ^ is the shear modulus and b is the strength of the dislocation, 
i.e. the length of the slip displacement it produces.* Since b is normally 
one atomic spacing this stress is about the yield stress of a single 
crystal when I is 10 4 atomic spacings. 

FiSHER 10 has pointed out that single-ended sources of the same length 
begin to work at only one-half of the above stress. This is because the 
other end of such a dislocation is not attached to a fixed point, but to 
some point along the line where the slip plane meets the surface of the 
crystal. Provided that the dislocation meets this line perpendicularly, 
in order to preserve an equilibrium offerees here, this external point of 
emergence is free to move along the line. As a result it is only necessary 
to bend the dislocation into a loop of radius Z, not Z/2, to set it working. 
FiSHER 10 has used this idea to explain an observation of BROWN and 
HoNEYCOMBE 11 that, on a purely electro polished surface of aluminium, 
the slip appears in the form of innumerable fine markings, instead of 
the much coarser slip bands some 10~ 3 -10~ 4 cm apart, which are typical 
of strained crystals with mechanically worked surfaces. FISHER argues 
that when the surface is unworked many more Frank-Read sources are 
active at the surface than in the interior, since for a given length of 
source the surface ones become active at only one-half of the stress 
needed for the interior ones. On the other hand, when the surface has 

* To conform to recent practice, we now use [A and b for those quantities symbolized 
by O and A, respectively, in T.D.I. 
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been mechanically worked these surface sources are less active because 
of the cold- working strains there. 

There can be little doubt that in practice a slip band can be made 
very quickly, in a fraction of a second. Can a single Frank-Read source 
make a slip band as fast as this ? The critical thing is the radius of 
curvature of the central loop of the rotating dislocation spiral, which is 
about fib/a. This central loop makes about ^/{27r(//6/cr)} revolutions per 
second, where v is the linear velocity of the dislocation line in the slip 
plane. The time needed to make a typical slip band, e.g. 1000 revolutions, 
is of order 10 3 (27r//6/wr) seconds. For v we shall assume the value 
10 4 cm sec" 1 , which is about 0-1 that of sound (see next section). 
Taking also a/ju, 10~ 4 and 6 = 3 x 10~ 8 cm, the time to make the 
slip band is less than 0-0002 sec. 

THE DYNAMIC PROPERTIES OF DISLOCATIONS 

The above theory of the static creation of dislocations raises the question 
of the present status of the earlier dynamic theory (T.D.I, page 100), 
and this leads to the more general question of the dynamics of moving 
dislocations. 

The original conclusion of FRENKEL and KONTOROVA (T.D.I, page 99) 
that the energy of a dislocation rises to infinity with its speed according 
to the "relativistic" formula 

U= U Q l(l-v*lc*) l , .... (2) 

where U is the rest energy of the dislocation and c is the speed of 
sound in the medium, has now been confirmed by the analysis of 
dislocations in three-dimensional bodies. 12 ' 13 14 A simple case is that 
of a screw dislocation lying along the z axis and moving along the x 
axis at a steady speed v. When v = the elastic equation of equilibrium 
governing the strain field of the dislocation is 

' .... (3) 

where w is the component of displacement along the z axis, the other 
two components being zero. For a simple screw dislocation of strength 
6 this has the solution 

w = 6(0/27r) = (6/277) tan- 1 (y/x), .... (4) 

where 6 is the angle measured about the screw axis from some arbitrary 
reference line. 

When this dislocation is moving the equation of equilibrium is found 
by equating the force acting on any volume element due to the stress 
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field to the force of inertia due to the acceleration of the matter in this 
volume element. Equation (3) then becomes 



.... (5) 

where t is the time and 

.... (6) 



is the speed of transverse sound or shear waves in the medium, p being 
its density. We need a solution that reduces to equation (4) when 
v 0, and moves along the x axis with a constant form and constant 
speed v. Equation (5) is a wave equation and it is known from general 
experience with these in electrodynamics that steady propagational 
solutions are formed by making a Lorentz transformation. In the present 
case this means substituting the variable 

Xl = (xvt)l(l- v 2 /c 2 ) 112 .... (7) 

in the equation. Noting that for a steady propagational solution the 
relation 3 2 /dt 2 = v 2 (S 2 /dx 2 ) holds, the equation then becomes 



which is identical in form with equation (3). Evidently the required 
solution is simply 

w = (6/277) tan- 1 (y/xj .... (9) 

The field of the moving dislocation is the same as that of the stationary 
one except that x replaces x. The factor (x vt) in x 1 accounts for the 
steady propagation of the dislocation while the Lorentz contraction 
factor (1 v 2 /c 2 ) 112 shows that the range of the stress field along the 
axis of motion is reduced. The energy of the moving dislocation can be 
found from this result. We add the density of elastic energy 
lju[(dw/dx) 2 + fiw/dy) 2 ] to the density of kinetic energy %p(dw/dt) 2 and 
a short calculation for 3w/D#, dw/dy, and dw/dt then leads to equation (2). 
This equation shows that a dislocation cannot reach the speed of 
sound because its energy would become infinite. An important question 
is whether it can reach a speed about 80 per cent that of sound, for then 
its behaviour would become significantly different from that of a slow 
dislocation ; for example, it would have enough excess energy to create 
new dislocations. The practical speed limit is reached when the energy 
gained by the moving dislocation from the applied stress is balanced by 
that lost from the dislocation by various dissipative processes. Some 
of these processes have been considered by SEixz, 15 ESHELBY, IG 
LEiBFRiED, 17 and NABARRO. IS The main ones are as follows : 
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Thermoelastic Damping 

In an elastic solid a change in hydrostatic pressure causes a change 
in temperature. When an edge dislocation moves the hydrostatic stress 
field caused by the dislocation also moves and so the temperature at any 
fixed point is continually changed. Heat flows down the temperature 
gradients set up in this moving field. This is a thermodynamically 
irreversible process and, therefore, dissipates the mechanical energy of 
the system. EsHELBY 16 has shown that thermoelastic damping is 
unlikely to prevent the dislocation from reaching "relativistic" speeds. 

Radiation Damping 

This occurs when elastic waves are transmitted outwards from the 
moving dislocation. Because of the atomic structure of the crystal and 
the breakdown of Hooke's law at large strains (T.D.I, page 84), the 
precise arrangement at the centre of a dislocation alters periodically 
as it moves from one lattice row to the next. The frequency with 
which it alters is v/b, where v is the speed and b the lattice spacing in the 
direction of motion. Elastic pulses are transmitted out to the sur- 
rounding crystal with this frequency and dissipate some of the energy 
of the dislocation. The mathematical analysis of these pulsations is 
difficult, and their effect is correspondingly uncertain, but NABARRO 19a 
believes that the dissipation is large enough to prevent relativistic 
speeds being reached unless the applied stress is at least one order of 
magnitude greater than the yield stress of soft crystals. SEiTZ 15 has 
reached the same conclusion by a different line of argiiment. 

Scattering of Sound Waves 

When a sound wave runs into a dislocation both may be disturbed. 18 
The inhomogeneous strain field at the centre of the dislocation refracts 
the sound wave through an angle. In turn the sound wave imposes an 
oscillating stress field which may make the dislocation oscillate in its 
slip plane and radiate out elastic waves. The effect of these is that the 
impinging sound wave exerts a force on the dislocation. When the 
dislocation is moving it runs into more sound waves on the leading 
side than on the trailing one and so is subjected to more forces pushing 
it back than forward. By assuming that a moving dislocation scatters 
sound waves in the same way that a line of atoms would, LEIBFRIED I? 
concludes that the damping due to the oscillation of the dislocation is 
so severe that fast speeds are quite impossible at the yield stresses of 
soft crystals. However, NABARRO IS& has pointed out that Leibfried's 
argument does not allow for the fact that the scattering power is 
proportional to the wavelength of the impinging waves. Those waves 
approaching the dislocation from behind have an increased effective 
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wavelength, due to the Doppler effect, and are, therefore, scattered 
more completely than those approaching from the front. In fact, this 
difference in scattering power exactly cancels the difference in the 
number of sound waves met by the dislocation on the two sides, so that 
Leibfried's effect does not exist, at least to the first order in v/c. 

We see that, although no firm conclusion has been reached, opinion 
is generally against the idea that dislocations can reach relativistic 
speeds, at low stresses at least. This opinion is supported by an experi- 
ment of LEiBFRiED, 20 who studied the extension of loaded aluminium 
crystals with an extremely sensitive extensometer and could not detect 
any of the sharp sudden increments of strain that would be expected if 
slip bands were forming from dislocations moving at relativistic speeds. 

Nevertheless, various suggestions have been made recently for 
dislocation processes involving relativistic speeds. OROWAN 21 has 
suggested that the unidirectional contraction in the stress field of a fast 
dislocation will cause it to turn out of its original slip plane, and he has 
used this to explain a spread of slip from one plane to another. HoLDEN 22 
has suggested that, at the upper yield point in iron, a dislocation that 
has broken free from its solute atoms (T.D.I, page 97) reaches high 
speeds and uses its kinetic energy to free nearby anchored dislocations. 

MoTT 23 and FISHER, HART and PRY 24 have used the idea of dynamic 
creation of dislocations at Frank-Read sources to explain the charac- 
teristic size of slip bands. They both consider double-ended sources and 
suggest that the stress /ub/l is necessary only to start the source working. 
The first loop of a sequence formed at the source must grow from a 
standing start and needs this stress, but all loops formed immediately 
afterwards in the sequence have a running start and do not need it. The 
dislocation line bridging the source has kinetic energy as it moves 
through its position of minimum length and, if dissipation is small, this 
energy is enough to carry the dislocation forward to the critical semi- 
circular shape beyond which it can expand into the slip plane once more. 
If this is true it means that once a source starts working it continues to 
do so until the stress round it has been reduced temporarily by an 
amount /.ib/l. This fall in stress is considered to occur as a result of the 
"back-stress" set up from the dislocations themselves created by the 
source, either when they become piled up against some obstacle (such 
as a deformation band) in the crystal, 23 or while they are still advancing 
out into the slip plane. 24 In both cases several hundred dislocations 
have to be created before the stress at the source falls sufficiently to 
stop it working, and this is held to account for the slip "avalanches" 
that the observations of HEIDENREICH and SnocKLEY, 25 and BROWN 26 
seem to require.* 

* See, however, the recent work of H. WILSDORF and D. KUHLMANN-WILSDORF, 
Z. angew. Phys., 1952, in press. 
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THE CHARACTERISTIC DISLOCATIONS OF CRYSTALS 

The most important feature of a dislocation is its Burgers vector. This 
is the vector defining the movement of the material on one side of a slip 
plane, relative to that on the other side, when the dislocation glides 
through this plane. A more sophisticated definition has been given by 
FRANK. 27 We shall now consider what conditions have to be satisfied 
by the Burgers vectors of dislocations in various crystal structures. 
Dislocations with such vectors will be called the characteristic disloca- 
tions of the crystals. 

The essential condition is that the slip displacement which the 
Burgers vector represents must be such as to translate the atoms from 
initial positions to final positions where they are mechanically stable, 




Fig. 8. Unit vectors in the simple cubic lattice 

since mechanically unstable configurations cannot exist. There are two 
kinds of such allowed translations, those that preserve the lattice 
(identity translations) and those that generate a new configuration, such 
as a twinned lattice. Dislocations are said to be perfect or imperfect, 
respectively, according as their Burgers vectors give identity transla- 
tions or new configurations. Imperfect dislocations lie along the 
boundaries of stacking faults in the lattice. 

Dislocations with large Burgers vectors, even though these give 
allowed translations, may be unstable for another reason. The elastic 
energy of a dislocation is proportional to the square of its Burgers 
vector, so that the energy is reduced when large dislocations split up 
into smaller ones. For example, if the lattice constant is 6 a perfect 
dislocation with strength 26 lowers its energy from 46 2 to 26 2 if it splits 
into two widely separated dislocations each of strength 6. FRANK 28 
has generalized this into a simple rule. In any combination or dissocia- 
tion of dislocations the elastic energy is lowered when the sum of the 
squares of the Burgers vectors is smaller for the product dislocations 
than for the reacting ones. 

The smallest perfect dislocations are called unit dislocations. In a 
simple cubic lattice of spacing a the unit vectors are a[100], a[110], and 
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a[l 1 1], as shown in Fig. 8. In this notation a vector a/n[hkl] is one with 
components ah/n, ak/n, and al/n, along the principal crystal axes. 
Dislocations with vectors a[100] have the lowest energies, but the 
dissociations 

a[lll] -> a[110] + a[001] -> a[100] + a[010] + a[001] .... (10) 

lead to no reduction in elastic energy, according to Frank's rule. 
However, they raise the configurational entropy, since there are more 
ways of distributing many dislocations of small strengths than few 
dislocations of large strengths through the crystal. 

A unit dislocation has lowest energy when its Burgers vector is 
parallel to a direction of closest packing in the lattice, which explains 
why this is the preferred direction of slip. The choice of slip plane is 





Fig. 9. Dissociation of a unit dislocation (a) into two 
partial dislocations (6) 

less easily made, both in theory and practice. According to NABARRO 19& 
this depends on two factors, both of which favour planes of wide 
spacings : (a) in a close-packed lattice a unit dislocation will dissociate 
into partial dislocations (see below) when it lies in a close-packed plane ; 
this leads to a lowering of strain energy and so dislocations should collect 
preferentially in these planes ; (6) the Peierls-Nabarro lattice force 
(T.D.I, page 90) resisting the movement of a dislocation is smallest 
when the Burgers vector is as small as possible and the spacing of the 
slip planes holding the dislocation is as big as possible. CHALMERS and 
MARTius 29 define a parameter /?, which is the strength of the dislocation 
divided by the spacing of the slip planes, and point out that most 
observed slip systems can be predicted from the rule that the preferred 
one is that with the smallest /?. 

If the lattice is suitable a unit dislocation may dissociate into 
partial dislocations, which are imperfect dislocations with Burgers 
vectors less than one atom spacing in length. This can happen when the 
crystal has more than one mechanically stable configuration. Fig. 9 
illustrates this diagrammatically. Here a unit edge dislocation dis- 
sociates into two partial dislocations, A and J3, which are joined by a 
strip of faulted material where the configuration is altered. The vector 
sum of the Burgers vectors of the partial dislocations is equal to the 
Burgers vector of the original dislocation. The question of how far the 
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partial dislocations separate depends on the energy of the stacking 
fault between them. This energy is proportional to the area of slip 
plane covered by the fault and hence to the distance between the 
dislocations at its boundary. Thus if e is the energy per unit area of the 
fault, the force per unit length pulling the dislocations together is also e. 
The repulsive force due to the elastic interaction of the dislocations is, 
by a generalization of equation (9) in T.D.I, ju(b A . b B )/27rkr, where 
(b A . b B ) is the scalar product of the Burgers vectors and r is the distance 
between the dislocations ; k 1 for screw dislocations, and ( 1 v) 
for edges, where v is Poisson's ratio. The dislocations are in stable 
equilibrium at the distance 

.... (11) 



where these two forces balance. In general this is only a few atom 
spacings. The two partial dislocations thus do not separate very far 
and to a distant observer appear like a single dislocation; they are 
referred to jointly as an extended dislocation. In crystals in which unit 
dislocations are able to extend themselves the plane of slip is more 
clearly defined than in others, because an extended dislocation has a 
unique plane, the plane of its stacking fault, in which it can move easily. 
We shall now consider the characteristic dislocations of some particu- 
lar crystal structures. 

Face Centred Cubic Crystals 

The shortest lattice vector is a/2[110], which joins an atom at a cube 
corner to a neighbour at a face centre, and defines the observed slip 
direction. Unit dislocations in {111} planes can lower their energies by 
splitting into two partial dislocations, the half -dislocations of HEIDEN- 
REICH and SnoCKLEY. 25 Consider slip on a (111) plane, as in Fig. 10. 
The vector b l a/2[101] defines an observed slip direction. THOMPSON 
and MiLLiNGTON 30 and MATHEWSON 31 pointed out that, if atoms are 
regarded as balls rolling over one another, then one in a position of type 
B would roll most easily to a nearby position of type C. On an atomic 
scale of magnitude the atoms are expected to follow a zig-zag path 
B -> C -> B etc, slipping alternately along [211] and [112], with a 
mean slip direction [101]. 

A slip from B to C changes the order of stacking of the (111) layers. 
A perfect sequence . . . ABCABCABC . . . becomes the faulted 
sequence . . . ABCACABCA . . . which contains a stacking fault 
. . . CACA . . . consisting of a thin layer of hexagonal close packing. 
Such a fault preserves the typical close packing of nearest neighbours 
and so should have a low energy of formation. This is confirmed by 
the observations that these faults are formed freely in cobalt 32 and 
lithium 33 during phase transformations, and in certain copper-silicon 
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alloys during cold working. 33 Moreover, the surface energy of {111} 
boundaries of annealing twins in copper, where the stacking sequence is 
very similar to that of a fault, is only 21 erg cm~ 2 according to 

FULLMAN. 34 

HEIDENREICH and SnocKLEY 25 proposed the dissociation reaction 






(12) 




Fig. 10. Slip directions in a (111) plane 

for a unit dislocation in (HI). This reaction is formally possible since 
the sum of the components of the Burgers vectors is the same for both 
reactant and product dislocations. Thus the right-hand side sums to 






~ [2 + 1, I + 1, I + 2] = [303] = [101]. 



(13) 



It is also energetically possible since by Frank's rule the energy is 
lowered from a 2 /2 to a 2 /3 when it occurs. In Fig. 10 the reaction is 
represented by b l -> 6 2 + 6 3 . The original unit dislocation splits into 
two partials which move apart in the (111) plane and generate a stacking 
fault in the region between them. The equilibrium spacing is given by 
equation (11) and if we take e 20 dyne cm" 1 , (b A . b B ) = a 2 /12, 
ju, 4 x 10 11 dyne cm~ 2 , and a 2 = L0~ 15 cm 2 , this spacing is 10 atoms. 
A shear stress applied along [101] drives both dislocations in the 
same direction, since they both contribute to a [101] slip, but a stress 
applied along [121] could either separate them or close them together. 
The stress to separate them completely is large, about 10~ 2 //, as is also 
the stress to force them together. Except in the presence of such 
stresses they always preserve their equilibrium spacing and move as a 
unit in the (111) plane. They cannot glide out of this plane, because 
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their stacking fault prevents this. If they should be forced together, 
to make a unit a/2[10l] dislocation, and this happens to be a pure 
screw it may then dissociate into two other partials, a/6[2ll] and 
a/6[lI2], lying in the (ill) plane. This reaction, which would cause 
cross-slip, 35 ' 36 > 37 might occur at places of high stress concentration or 
at points of emergence for the a/2[101] dislocation, but apart from this 
deviations from slip on single (111) planes will be rare. 

FRANK 27 has suggested another type of partial dislocation, the 
sessile dislocation, in the face centred cubic lattice. Suppose in Fig. 1 1 

that we are looking sideways at the 
edges of (111) planes. There is a 
fault in the centre where part of 
the central A layer is missing. 
The neighbouring B and C layers 
collapse towards each other in this 
region and fit together in correct 
Fig. 11. A sessile dislocation close-packed order in the faulted 

sequence ABCBCA. 

At the rim of this region an edge dislocation is formed with a Burgers 
vector a/3[lll]. Its direction of glide is [111] but it cannot move this 
way because this would involve joining pieces of B layer to A layer etc 
which would leave badly disarranged atoms on the surface delineated by 
its track. The dislocation is therefore unable to glide, as its name 
implies. Sessile dislocations can be made either by the aggregation of 
vacant sites into sheets on {111} planes, forming disc-shaped holes in 
these planes whose sides then collapse, as in Fig. 11, 38 39 or by the 
dissociation of unit dislocations ; for example, the reaction 

\ [HO] -+ 1 [111] +|[112] .... (14) 

produces a sessile dislocation and a half-dislocation, both lying in (111). 
Since face centred cubic crystals can glide on intersecting {111} 
planes reactions may occur between dislocations moving on these 
planes. One of the most interesting is that studied by LoMER. 40 He 
considered two unit dislocations, both parallel to [110]. The first 
moves in (1 1 1) and has a Burgers vector a/2[10l], while the second moves 
in (111) and has a vector a/2[011]. By Frank's rule they attract each 
other and so run down their respective slip planes until they meet and 
coalesce at the [110] line where these planes cross. The reaction is 

f[10l]+|[011]-*|[110] (15) 

This new dislocation is an edge and cannot glide easily because its slip 
plane, i.e. the plane containing its line and its Burgers vector, is (001) 
and not a common plane for slip. 
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CoTTBELL 41 has pointed out that this dislocation may become 
completely immobile by dissociating, 

[110] -> [112] + [H2] + [110] ..... (16) 



The first two of these partial dislocations separate along their slip 
planes, (111) and (111) respectively, and produce stacking faults in 
these planes connecting them with the immobile a/6[110] dislocation in 
the site of the reaction, the line where these faults join. The mobile 
dislocations separate by a few atoms until a position is reached where 
the elastic repulsion on them is just balanced by the surface tension 
of their stacking faults. This triangular group of parallel dislocation 
lines obviously cannot glide. 

Close Packed Hexagonal Crystals 

The unit vector in the basal plane is of length a, the lattice constant in 
this plane, and follows a (2llO) direction of closest packing, which is 
also the observed slip direction. Dislocations with such vectors can 
follow any path through the crystal but prefer to lie in basal planes 
where they can dissociate into half-dislocations, as in the face centred 
cubic lattice. Basal slip is common in this structure. The shortest 
lattice vector out of the basal plane is [0001] and of length c, the lattice 
constant along the hexagonal axis. Dislocations with this vector are not 
easily moved but are important in crystal growth. The growth spiral 
shown in Fig. 5 starts from such a dislocation. Dislocations with Burgers 
vectors along the hexagonal axis may also be important as pole disloca- 
tions in deformation twinning 42 ' 43 (see below). 

Body Centred Cubic Crystals 

The shortest lattice vector is a/2[lll] which joins an atom at a cube 
corner to the one in the cube centre, and defines the observed slip 
direction. 

This lattice can be twinned by a shear of l/\/2 or A/2 in a (ill) 
direction on a {112} plane, and twins of this kind can be produced by 
deformation in iron. This raises the possibility of a unit dislocation in 
a (112) plane dissociating as follows 



to make two partial dislocations separated by a stacking fault consisting 
of a monolayer of twinned crystal. This dissociation is less certain than 
those discussed above because the original packing of nearest neigh- 
bours is not preserved on the twin interface. These partial dislocations 
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ha,ve parallel Burgers vectors and cannot be separated by a homo- 
geneous stress. Also, they are both of pure screw type when they lie 
along [111], and since [111] is the line along which (112), (121) and 
(2ll), intersect, they can glide out of their original slip plane when in 
this orientation. Thus the limitation to a single slip plane is less severe 
than in the other crystal structures considered. 

A sessile dislocation may perhaps be formed by a dissociation of the 



[HI]-* [H2] + [HI] .... (18) 

when the reacting dislocation lies in (112). The dislocation a/3[112] is 
pure edge and sessile. It forms the boundary of a stacking fault made 
by inserting two fault planes in the (112) stacking sequence. The 
sequence is ... ABCDEFA . . ., and the fault is of the land 
. . . ABCDCDEFA .... 

THE GROWTH OF DEFORMATION TWINS 

The idea that deformation twinning takes place by the continuous 
growth on an atomic scale of twinned material has often been suggested, 
and the general arguments for a dislocation mechanism of twinning are 
the same as for slip : it is unbelievable that the atoms all move simul- 
taneously, and the stress for twinning is far below the theoretical shear 
strength of a perfect lattice. Examples of deformation twinning are 
known which consist either of simple shears on the twinning plane 
(body centred cubic iron) or of shears accompanied by localized 
re -arrangements ("shuffles") of the atoms (hexagonal metal crystals). 
The shears can be produced directly by partial dislocations, but not the 
shuffles; it is presumed that these follow the shears automatically, 
since the structure will generally be mechanically unstable until the 
twinning process is completed. 

The main problem is to understand how twinning develops homo- 
geneously through successive planes. This seems to require either a 
twinning dislocation on every plane, or the motion of a single dislocation 
successively from plane to plane. COTTRELL and BiLBY 44 have recently 
suggested a mechanism, based on that of FRANK and READ for slip, 
whereby successive twinning planes are swept consecutively by a single 
twinning dislocation. Fig. 12 illustrates it. Here OA, OB, and 00 are 
dislocation lines and ODE is the twinning plane. The twinning disloca- 
tion 00 has its Burgers vector in this plane, and this vector is such that 
the dislocation produces the correct shear for twinning between the 
atoms bounding this plane as it sweeps through the plane, rotating 
about its point of emergence 0. The dislocations OA and OB are 
immobile and each has a component in its Burgers vector along the 
normal to the twinning plane ; this component of the vector makes the 
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dislocations OA and OB partly or wholly of screw character, with a 
pitch equal to the spacing of the twinning layers. Because of this, these 
twinning layers are not a number of separate planes stacked on one 
another, but the leaves of a single continuous spiral surface (a helicoid 
or Riemann surface), winding round the pole dislocation AOB like a 
spiral staircase. It follows that, as the twinning dislocation rotates 
round the pole dislocation, it not only produces a monolayer sheet of 
twinned crystal but also climbs up the "staircase" to the next layer, 
where it repeats the process. In this way a thick layer of twin can be 
built up. The time needed to make a twin by this process is similar to 
that to make a slip band by the Frank- 
Read mechanism, and it is known experi- 
mentally that deformation twins can form 
very quickly, in a few microseconds. 43 45 

COTTRELL and BILBY have applied this 
theory to the body centred cubic lattice. 
Suppose that the line AOB in Fig. 12 was 
originally a unit dislocation with Burgers 
vector a/2[lll] and that the part OB lies 
in a ( 1 1 2) plane. Under a suitable stress the L 

reaction 18 can occur at OB, leaving OB Fig 12 The pole mcchanism 
itself as a sessile dislocation a/3[112] and of deformation twinning 
producing the twinning dislocation OC, 

a/6[lll]. If OC turns into a twinning plane ODE which is (121) or 
(211) it can rotate in this about the pole dislocation and make a twin. 

A similar analysis pan be made for the face centred cubic lattice, 
starting from the reaction 14, but in this case only a twin monolayer 
can be formed, not a thick twin, because the OB part of the pole 
dislocation, a/3[lll], has to lie in the (111) twinning plane and the 
a/6[ll2] twinning dislocation runs into it at the end of one revolution 
in this plane. This process may be important in the production of single 
stacking faults in this lattice, a problem which has also been considered 
from the dislocation standpoint by CHRISTIAN. 46 Pole mechanisms for 
the growth of deformation twins in hexagonal metals have been sug- 
gested recently, 42 ' 43 and in these the pole dislocation has a Burgers 
vector along the hexagonal axis. It is probable that shear transforma- 
tions of the martensitic typo occur by the movement of dislocations, 47 
and FRANK 48 has recently discussed the crystallography of the austenite- 
martensite transformation from this standpoint. The problem is 
complicated by the fact that the deformation is not a single shear. 49 

INTERACTIONS OF DISLOCATIONS 

Three kinds of interactions involving dislocations have now been 
recognized, two of which are elastic in nature while the third is 

223 



PROGRESS IN METAL PHYSIOS 

geometrical. The first kind of elastic interaction is the ordinary long- 
range one discussed in T.D.I. We call this the "Hookean" interaction 
because it depends on stress fields round dislocations which in turn 
depend on Hooke's law being obeyed at small strains. Recent work on 
Hookean interactions has been concerned either with confirming existing 
results by methods of greater rigour and power, 50 " 54 or with developing 
new results, mainly about arrays of dislocations in slip planes or in crystal 
boundaries. Non-Hookean interactions originate in deviations of the 
atomic law of force in the slip planes from Hooke's law, and are impor- 
tant when considering the stress necessary to drive a dislocation 
through a grain boundary, or a sheet of stacking fault, or any similar 
inhomogeneity in the material. Geometrical interactions occur when 
intersecting dislocations in different glide systems cut through one 
another. The "jogs'* that are then formed in these dislocations have 
some interesting properties; in particular jogs in screw dislocations 
create vacant atomic sites or interstitial atoms when these dislocations 
are moved. We shall discuss some of the more interesting of these 
various effects. 

DISLOCATIONS IN CRYSTAL BOUNDARIES 

The idea that grain boundaries between crystals inclined at small 
angles to each other are composed of dislocations has already been 
discussed in T.D.I. It has attracted interest because the properties of 
these small-angle boundaries can be measured experimentally and so 
form a testing ground for the theory. SHOCKLEY and READ 55 have been 
mainly responsible for developing this line of approach. 
There are three properties capable of experimental study : 

(a) The relation between the spacing of dislocations in the boundary 
and the angle between the grains. 

(b) The mobility of the boundary under stress. 

(c) The energy of the boundary, and its variation with the angle 
between the grains. 

We shall discuss these from the standpoint of the classical Burgers 
tilt boundary shown in Fig. 13. Here two crystals are joined along the 
plane x = after being rotated by equal and opposite amounts about 
the y axis. The angle of rotation is 

6 = 2 tan- 1 (b/2h) ~ b/h, .... (19) 

where h is the spacing of edge dislocations of strength b in the boundary. 
When this angle is sufficiently small (< 10~ 3 radians) the spacing 
between the dislocations is large enough to be resolvable in a micro- 
scope. SHOCKLEY and READ pointed out that the dislocation structure 
of such boundaries has in fact already been resolved by LACOMBE 66 in 
a study of "veining" in aluminium. LACOMBE found that a network of 
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faint lines existed in a crystal which consisted of rows of separate etch 
pits, and that there was a small difference in orientation between the 
regions separated by these "veins/' SHOCKLEY and READ suggested 
that the etch pits were formed at the points of emergence of the disloca- 
tions in the boundaries. The order of magnitude of the angle 6 across 
such a boundary was 10~ 3 radians or less, while the spacing of the etch 
pits was not more than about 3 X 10~ 4 cm. Further work along these 
lines should enable a rigorous check of equation (19) to be made. 

The dislocations of the simple boundary of Fig. 13 are geometrically 
free to glide as a unit along their slip 
planes, although this property does 
not extend to some of the more com- 
plicated boundaries containing two or 
more kinds of dislocations. 55 More- 
over, the force from an applied stress 
on any one of the dislocations in this 
simple boundary is exactly the same 
as it would be (T.D.I, equation (1)) 
if this dislocation were alone present 
in the crystal, and so the boundary 
should move when a small shear 
stress is applied to the crystal. 
Boundaries of this simple and mobile 
kind are frequently formed during the 
bending and cutting of hexagonal 
crystals, 67 where they are called bend planes, and direct visual 
observation of their movement under stress has now been obtained. 58 

The reason why the boundary of Fig. 13 is mechanically stable 
was given in T.D.I. When the horizontal distance between two parallel 
edge dislocations of the same sign is smaller than the (vertical) distance 
between their slip planes, they are attracted by their Hookean inter- 
action along the slip direction and pull into alignment, one above the 
other. The strain energy is smaller for the arrangement of dislocations 
in Fig. 13 than for other arrangements. The boundary in this diagram 
is a surface across which each crystal exerts forces on the other. These 
forces are associated with local indentations along the lines where a step 
on one face is pressed into the other face. These steps occur at intervals 
h up the face, so that the forces alternate in sign along the y axis with a 
period h. Elasticity theory shows that periodic forces of this kind, 
with zero resultant and moment, produce stress fields that penetrate 
inwards from the surface hardly beyond a distance of order h. 

Now the strain energy of a dislocation is logarithmically proportional 
to the radius of its stress field (T.D.I, equation (7)). It follows that the 
strain energy of a dislocation in the boundary is proportional to log h, 
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Fig. 13. The simple Burgers 
boundary 
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and this is smaller than could be obtained by any other distribution of 
the dislocations in the crystal. Dislocations occur at intervals h along 
the boundary so that the elastic energy of unit area of the boundary is 
proportional to (I/A) log h, i.e. to 6 log 0. The energy contributed by 
the cores of the dislocations is proportional to the number of these, i.e. 
to 0. Hence the surface energy of the boundary should increase with 
angle according as the function 6( A log 0), where A is a constant. 55 
A general formula for the stress field of the boundary can be found 
by adding together the fields of the dislocations in it. 55 59 This gives the 
shear stress acting on the slip plane of one of the dislocations in the 
slip direction as 

__ fib TT*X I 

' ( 20 ) 



where x is the distance in the slip direction from the boundary. We can 
use this to find the strain energy of the boundary. Suppose that we 
start with a single crystal and make the boundary by (a) cutting from 
one side to the centre along every slip plane that is to contain a disloca- 
tion, and (6) applying to the two faces of each of these cuts the force 
needed to displace one face rigidly past the other by the distance b. In 
this process all dislocations in the boundary are to be created simul- 
taneously so that the applied forces are the same on all cuts. The total 
work done by these forces is the energy of the boundary. We know the 
magnitude of the force at every point on the cut faces from equation (20), 
and also at every stage during the creation of the boundary because, 
from Hooke's law, the displacement of the cut surfaces increases linearly 
with the force on them. Thus the strain energy per dislocation is 
X force X displacement, integrated along the slip plane, i.e. 

. - ^ 



v> 



7 > 

477(1 V) 



where x is the radius of the core of the dislocation, x^ is the length 
of the cut plane, and where equation (20) has been used for a xv . Intro- 
ducing for b/h, the energy of unit area of boundary is 



where A = I + log (b/27rx ). .... (21) 

This result is a special case of a more general one derived by SHOOKLEY 
and READ for boundaries containing more than one kind of dislocation. 
The contribution of the core energies can be included in the formula 
by suitably altering the magnitude of A , and hence X Q . Using the disloca- 
tion model of PEIERLS and NABARRO (T.D.I, page 90), VAN DER 
MERWE 60 has shown that A is typically about 0-5. 
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The measurement of relative grain boundary energies in solids has 
been made possible by the methods of SMITH 61 and SHUTTLEWORTH, 
KING and CHALMERS, 62 which determine the relative surface tensions of 
boundaries from the angles at which three of them meet along a line. 
These have been used to test the 0(A log 6) law by DUNN and 
LiONETTi 63 and AUST and CHALMERS. 64 Fig. 14 shows the results of 
AUST and CHALMERS on lead compared with such a curve, using the 
value A = 0-2. 

It is surprising that there is agreement not only at low angles, where 
it might be expected, but also at large angles where the theory is no 
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Difference in orientation degrees 

Fig. 14. Variation of boundary free energy with difference in 

orientation between crystals of lead; (1) theoretical curve with 

A = 0-2 ; O experimental values of AUST and CHALMERS 

longer reliable. LOMER and NYE 65 have shown with the bubble raft 
(T.D.I) that the nature of the boundary changes rapidly, when 6 
exceeds a few degrees, from one made of dislocations to one in which 
dislocations cannot be recognized. In particular the dislocations, which 
are wide (T.D.I, page 91) when is small, contract when approaches 
the range where the spacing between dislocations is equal to their 
width. This contraction changes them into defects resembling in some 
ways rows of vacant sites. 

NABARRo's 19 explanation of the contraction is as follows. It is 
possible to regard an ordinary dislocation as a narrow assemblage in 
the slip plane of many dislocations of infinitesimal strengths. 66 The 
width /? of the dislocation is governed by the elastic repulsions of these 
infinitesimal dislocations and by the misfit energy in the strip of slip 
plane containing it. Now a wall of dislocations such as that in Fig. 13 
can be regarded as a set of parallel and closely spaced walls, the com- 
ponent dislocations of which are the infinitesimal dislocations which 
define ft. This becomes important when the spacing h is smaller than 
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/?, for then ft is determined by the elastic repulsion of walls of infinitesimal 
dislocations. As equation (20) shows, the stress field of a wall of dis- 
locations has a shorter range than that of a single one. The walls of 
infinitesimal dislocations therefore move closer together as 6 is increased 
into the range where h ~ 6, which is the effect that LOMEB and NYE 
observe. 

DISLOCATIONS IN SLIP BANDS 

The slip displacement at one end of a long slip plane can differ sub- 
stantially from that at the other end, as may be seen from the fact that 
the arrangement of slip bands on one side of a worked crystal is often 
quite different in detail from that on the other side. This difference in 
displacement is taken up by a set of dislocations spaced through the 
slip plane. Various aspects of such arrays of dislocations are interesting. 
On the purely formal side, it is possible to regard the whole set of dis- 



P 



Fig. 15. A row of edge dislocations pressed against 
an obstacle 

locations as a single dislocation of a more general kind, with a Burgers 
vector that is not constant along the slip plane. 67 " 70 Another approach 
is that of NYE, ?I who, by a photoelastic study of internal stresses in 
deformed silver chloride crystals, has shown very clearly that some of 
these stresses are caused by rows of edge dislocations, all of the same 
sign, contained in bent slip planes. 

In T.D.I, page 105, it was shown that a row of n edge dislocations, all 
of the same sign, which are pushed against an obstacle in the slip 
plane exert a strong pressure against it (Fig. 15); in particular the 
leading dislocation in the sequence is driven against it by a stress n 
times the applied stress. It was also pointed out that the dislocations 
are spaced more closely together at the obstacle end of the sequence, 
and that because they are all of the same sign they bend the slip 
plane. The point of interest about this arrangement of dislocations is 
that it represents the spearhead of a slip band held up by an obstruction. 
The pressure exerted by the dislocations amounts in effect to a concen- 
tration of shear stress at the end of the slip band, and this concentration 
may enable the slip band to push through the obstacle into the crystal 
beyond it. 72 

This stress concentration depends upon L, the length of the slip 
plane from the source S to the obstacle P, since this is a factor on 
which the number of dislocations piled up depends. We can estimate n 
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roughly as follows. Suppose that the source becomes active when an 
arbitrarily small stress acts on it, and that this stress is also sufficient 
to make dislocations glide in the plane. If a stress a is applied to the 
crystal the source feeds dislocations into the slip plane until the back 
stress from these cancels the applied stress. Roughly, then, the source 
stops working when the applied stress is relaxed to zero inside an 
approximately circular region of radius L centred on the source. The 
elastic strain in this region is correspondingly reduced from the value 
a/jic to zero, and this is accomplished by a slip displacement 2L(a/{t) 
at the source. To produce this displacement by dislocations of strength 

6 the number required is nT , , /n ^ 

^ n 2La/jub. .... (22) 

A more accurate theory has been given by ESHELBY, FRANK and 
NABARRO, 73 which enables the positions of the dislocations to be found. 
These positions, denoted by x i (i = 1, 2, . . . 71), are defined as the 
values of the distance x along the slip plane at which the polynomial 

f(x) = (x-xj(x-x 2 ) . . . (x~x n ) .... (23) 

is zero. The equilibrium of the dislocations can be stated in the form of a 
differential equation for f(x) the solution of which is a polynomial equal 
to zero at n different values of x. In the problem of Fig. 15 it turns out 
that the distribution of the n dislocations along the slip plane is 
identical with that of the radial distribution of nodes in the ns state 
of the hydrogen atom. 

For large values of n the following results are obtained : 73 

(a) The number of dislocations packed into a length L of the slip 

plane is 
r 



T 7 , , /rtjx 

n = TrLak/jub, .... (24) 

where k is unity for a screw dislocation, (1 v) for an edge and v is 
Poisson's ratio. This compares with our equation (22) above. 
(6) The distance between the two leading dislocations is 

d = l-84(^6/27rA;n(T). .... (25) 



(c) The shear stress on the glide plane at a distance x ahead of the 
leading dislocation is practically the same as that due to this dislocation 
alone in the region where x <^ d. At distances where d <^ x <^ L the 
stress concentration factor is approximately 

1 + (L/x) 1 *. .... (26) 

This stress concentration can be large, and the dislocations then produce 
the same stress concentration as does a freely slipping crack of the same 
length. 74 

(d) The total slip produced by the sequence is the same as if a single 
dislocation of strength nb were moved a distance 3L/4. 
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FRANK 75 has suggested that this high concentration of stress at the 
end of a piled up row of dislocations may be sufficient in a poly- 
crystalline specimen to force the creation of dislocations on the other 
side of a grain boundary which is holding up the row. In the next 
section we shall turn to consider the nature of the force preventing the 
leading dislocation from entering the grain boundary. 

Note added in proof: D. KUHLMANN-WILSDORF, J. H. VAN DER MKRWE, 
and H. WILSDORF, [Phil. Mag. 43 (1952) 632] have recently obtained experi- 
mental evidence supporting the theory of dislocations in slip bands. With 
the aid of an electron microscope they measured the spacing, length, and 
amount of glide, of individual fine slip lines formed in lightly strained alu- 
minium. The number of dislocations in an elementary slip line was calculated 
from these measurements, and it was shown that the observed spacing of the 
lines is comparable with the theoretical critical distance below which rows 
of dislocations in parallel slip planes cannot pass each other. 

OBSTRUCTION OF DISLOCATIONS AT GRAIN BOUNDARIES 

As a dislocation approaches a grain boundary it may be subjected to 
three forces, one long-range and two short-range ones. The long-range 
force occurs where there is a change of elastic constants beyond the 
boundary, for example, where the boundary is a free surface, or where 
the boundary is between two phases of an alloy, or between two crystals 
in an elastically anisotropic material. The force occurs because, as the 
dislocation approaches the boundary, more of its strain field is imposed 
on the material beyond, and the energy to deform this is different 
from that to deform the material on the near side of the boundary. The 
only case that has been studied in detail is that of a free surface. 
KoEHLER 76 showed that a dislocation is attracted towards this by an 
"image" force inversely proportional to the distance from the surface. 

When applying this result in practice it should be remembered that 
many surfaces are covered with films, e.g. oxide coats on metals. The 
presence of an adherent film turns the boundary of the crystal into an 
interface between phases, and this will generally repel dislocations 
approaching it from inside the crystal. There is much experimental 
evidence for the fact that oxide coats make slip more difficult in soft 
metal crystals. 77 A striking indication of the pressure of piled up 
dislocations against such oxide coats has been given by BARRETT ; 78 he 
showed that if the oxide coat were removed from a zinc crystal that had 
recently been twisted plastically, the crystal would spontaneously twist 
a little further as the piled-up dislocations were allowed to run out. 

The two short-range forces acting on a dislocation approaching a 
boundary are that due to the stress field of the dislocations in a boundary 
between different crystals, and one due to the fact that the law of 
force between atoms in the slip plane changes at the boundary. This 
latter is an example of a non-Hookean interaction and we shall use it to 
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illustrate the general theory of such interactions, developed by 

J AS WON and FOREMAN. 79 

Suppose in Fig. 16 that a dislocation in the slip plane PQ is approach- 
ing a boundary AB beyond which the law of force between the atoms 
in the slip plane is different. In fact QR need not be a real slip plane 
at all, but merely a geometrical continuation into the region beyond the 
boundary. To make a definite picture, suppose that AB is the boundary 
between two elastically isotropic crystals that differ only in orientation 
from one another. Then for small strains the law of force between the 
atoms across the slip plane is the same in PQ and QR, and is Hooke's 
law a xy ju,cf)/d, where a xu is the shear stress, c/> is the shear displacement 



B 
Fig. 10. A dislocation approaching a boundary 

and d is the spacing of the planes. For large strains the force is larger 
in QR than in PQ, since QR is not physically suitable for slip. 

At some distance from it the dislocation produces strains in the 
boundary that are small enough for Hooke's law to be obeyed. There 
is no Hookean distinction between PQ and QR, so that no force is 
caused by the change in the nature of the slip plane until the disloca- 
tion is close enough for Hooke's law to be no longer obeyed at the 
boundary. This non-Hookean force is thus a short-range one. 

JASWON and FOREMAN give a theorem for calculating s, the distance 
to which a dislocation can be pushed towards the boundary by an 
applied stress a. Later, NABARRO IS& showed that this could be derived 
rather simply by applying the principle of virtual work. Consider a 
piece of slip plane lying between x and x + dx from the dislocation, 
and denote the energy in this piece by g(</))dx when x < s, and by 
h(</))dx when x > s. The shear displacement depends on x, and the 
laws of force in the two regions of the slip plane are a xv = dgjd^ and 
a xy dhjd<l>, respectively. Let the entire field of displacement 
representing the dislocation be moved forward rigidly by a small 
distance dx. The strain energy in the (Hookean) regions outside the 
slip plane is not altered, while the energy in the slip plane is altered by 
the same amount as if the boundary AB were moved from s to s dx. 
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This energy change is {h(<j> s ) g((f> s )}dx, where cf> 8 is the value of <f> at s. 
The work done by the applied stress in moving the dislocation is 
abdx per unit length. Hence the condition for equilibrium is 

<rb = h(<f> s )-g(<f> s ). .... (27) 

This is an equation for s and to solve it we have first to choose suitable 
functions for g(c/>) and h(<f>). JASWON and FOREMAN use the Peierls 
sinusoidal law (T.D.I, page 90) for g and Hooke's law for h. Next we 
have to determine <f> 3 as a function of s ; the simplest procedure here is 
to take an average of the values for < obtained by assuming in turn 
:hat g(cf>) or h((f>) is valid throughout the whole slip plane. 

Using the above laws of force JASWON and FOREMAN show that the 
stress needed to hold the dislocation at s is given approximately by 

or = (t*l<hr*)(p/8)* 9 .... (28) 



/? = 6/2(1 v). The short-range character of this interaction, 
varying as s~ 4 , is to be noted. As a result, the stress needed to hold a 
iislocation at 10 atom spacings from the boundary is only lO" 5 /^, 
whereas that needed to hold it at 1 atom spacing is nearly as large as 
;he theoretical shear strength of the lattice, ///30. 

MECHANICAL HYSTERESIS IN POLYCRYSTALS 

3ne of the effects of the obstruction of dislocations at grain boundaries 
s that a polycrystal should show mechanical hysteresis. 80 Other 
)bstacles such as mosaic boundaries, oxide films, and stacking faults, 
nay act in the same way and much of the following discussion can also 
;>e applied to them. 

For simplicity we suppose that the applied stress is only sufficient 
,o produce slip in a few favourably oriented grains, although the 
-heory can also be applied to larger deformations if attention is con- 
centrated on the excess of slip in one grain over that in its neighbours. 
#lien the proportion of plastic grains is small it is fair to assume that 
iach of these behaves as if it were mounted in an elastic matrix. Under 
hese circumstances the maximum slip it can undergo is that which 
elaxes the stress in it from the initial value a to some lower one a i9 
yhich is the critical stress below which dislocations are no longer 
nobile. If the linear dimension of the grain is L the total slip displace- 
nent in it is of order L(a o* t )///, and if the proportion of such grains 
n the specimen is a the overall strain produced by slip in them is of 
he order 

a(<r-<jr<)/^, .... (29) 

yhen a<^ 1. 

The stress a t takes account of the various obstacles, such as foreign 
/toms and precipitated particles, that a dislocation has to overcome 
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when gliding through its slip plane. We can regard this stress as acting 
like a frictional force which prevents a dislocation from moving in either 
direction unless the applied stress is greater than it. 

This model has been used by NABARRO in the following way to 
discuss mechanical hysteresis. 80 Suppose that the applied stress is 
increased from zero towards a value a which exceeds a i in a few of the 
grains. At first the deformation is purely elastic. When the stress 
reaches o t dislocations begin to move and pile up at the boundaries of 
the plastic grains, their mutual repulsion producing a back stress a d . 
At the applied stress a dislocations move up to the boundaries until the 
back stress acting on each of them reaches the value given by 
a d = a cr,. Plastic flow then stops. On unloading no plastic flow can 
occur until the applied stress is reduced by 2a it so that the unloading 
curve starts off as a straight line. When the applied stress is reduced 
to a 2Oj the dislocations begin to move backwards into the centre of 
the grain, giving a reversed plastic flow. This becomes possible because 
the actual stress acting in this direction on each one of them is now 
a d (a 2ffj) which, since a a f = a d , is equal to or t . 

When a > 2a t the applied stress a 2a i is positive, so that reversed 
plastic flow begins before the specimen has been fully unloaded. 
On the other hand, when a < 2a i the specimen unloads completely 
without undergoing reversed plastic flow. Two effects can then be 
observed. If the applied stress is taken through zero and increased in 
the reverse direction, the elastic limit at which flow begins in this 
direction is 2a t or, which is numerically smaller than the original 
elastic limit, a t . This is the Bauschinger effect. Instead of loading in the 
reverse direction, the specimen can be rested in the unloaded state. 
The stress a d ( a a { ) is then not strong enough to overcome the 
frictional stress a { due to the obstacles in the slip plane, but even so the 
dislocations may occasionally surmount these obstacles with the aid of 
thermal agitation. This is an example of the elastic after-effect. 

KuHLMANN 81 has shown that this kind of model can be used to 
explain why the residual strain in the after-effect changes logarith- 
mically with time. The stress a d depends on the number n of dislocations 
piled against the boundary. The activation energy for the return of a 
dislocation is a function of a^ a d and hence of n ; for small changes in 
n it can be written in the form W (in, where W and ft are constants. 
The rate of change of n due to the movement of dislocations back into 
the grains is dn/dt = A exp { (W fin)/kT}, where A is a constant. 
This integrates to give 

n = n - (kT/p) log (1 + tfo), .... (30) 
where n is the initial value of n and t is defined by 

g(pAt /kT) (31) 
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The residual strain is proportional to n and hence depends logarithmi- 
cally on time. 

DISLOCATION JOGS AND THE CREATION OF VACANCIES 

We turn now to the third kind of interaction involving dislocations, the 
geometrical interaction which occurs when two non-parallel dislocation 
lines cut through each other. HEIDENREICH and SnocKLEY 25 pointed 
out that this would be a source of hardening in crystals. 

Consider two straight dislocation lines A and B, as in Fig. 17 (a). 
Suppose that dislocation A glides in its plane and cuts through B, as 




Fig. 17. The formation of dislocation jogs 

in diagram (6). Then on each dislocation a jog will be formed, at P and 
Q. For instance, the part of B above the slip plane of A is moved, being 
carried along in the top half-crystal, relative to the part below the plane 
when the dislocation A passes through. The direction and magnitude 
of this displacement are equal to the Burgers vector of dislocation A. 
Since the Burgers vector must be constant along a single dislocation 
line, and since this line cannot end within the crystal (T.D.I, page 80), 
the displaced ends of dislocation B are necessarily connected by a 
short piece of dislocation, the jog, which lies in the slip plane of A and 
has the same Burgers vector as B. To summarize, a jog is a dislocation 
with the same Burgers vector as the dislocation line which contains it, 
while the length and direction of the line of the jog is Burgers vector of 
the dislocation that has cut through. 

A jog will usually be one atom long. Now the energy of a dislocation 
is about //6 3 per atom plane (T.D.I, page 116), so that the energy of a 
jog is a^6 3 , where a ~ 1. To calculate the stress needed to make a 
jog consider the situation shown in Fig. 18, where a dislocation line is 
about to cut through a row of equidistant dislocations spaced I apart. 

If this dislocation moves forward a distance of, say, 6, i.<~. one atom 
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spacing, it cuts through all the intersecting dislocations and makes I /I 
jogs in unit length. The dislocation energy is increased by a^ft 3 //, 
but the work gained from the applied stress a is ab 2 . Equating these, 
the stress needed to make the jogs is 

.... (32) 



so that if the density of intersecting dislocations is /o, the stress needed 
to drive a dislocation through them is 

. .... (33) 



Taking p = 10 12 cm~ 2 for a heavily worked material (T.D.I, page 104), 
and b = 3 X 10~ 8 cm, this is a/^/50. With a = 1 (which may be an 
overestimate) this stress is about ten times greater than that predicted 



Fig. 18. A dislocation cutting through a row of dislocations 

from Taylor's theory of work hardening (T.D.I, page 106). Taylor's 
theory also leads to the relation (33) for the stress, but with an a of the 
order of 0*1. 

If the applied stress is leas than Kjub/l the dislocation can still go 
forward with the aid of thermal fluctuations. The activation energy to 
make a jog is 

U ^ Kjiib* - obH. .... (34) 



There are two points about this : (a) since a/*6 3 is only a few electron 
volts (T.D.I, page 117) the activation energy is always fairly small, 
even at low stresses, so that there should be an appreciable temperature 
dependence for the hardening caused by intersecting dislocations ; (b) 
the activation energy varies linearly with stress, a relation that is 
needed to explain some creep results described later ; NABARRO 82 has 
shown that hardly any other process involving dislocations can give 
this linear relation. The significance of these effects for work hardening 
and creep will be discussed later. 

We must now consider whether a jog is an impediment to the move- 
ment of the dislocation containing it. Jogs in edge dislocations should 
not have much effect. The interesting case, first pointed out by 
THORNTON READ and later discussed by SEiTz 39 83 and MoTT, 84 is that 
where two screw dislocations cross each other. If a screw dislocation 
contains a jog that is perpendicular to the screw axis then this jog is 
a piece of edge dislocation, since its line and its Burgers vector are 
perpendicular to each other. These two directions define a plane, the 
slip plane of the edge dislocation, in which it can glide easily. But this 
implies a ^jiovement of the jog along the screw axis and not a movement 

235 



PROGRESS IN METAL PHYSICS 

with the screw dislocation itself. Any movement of the screw, with the 
jog, in the direction perpendicular to the slip plane of the jog, involves 
either lengthening or shortening the half-plane (T.D.I, page 86) 
belonging to the edge dislocation in the jog ; this can only occur by the 
jog leaving behind itself a row of vacant atomic sites, or of atoms 
misplaced into interstitial positions, as it moves along with the screw. 
If the temperature is high enough for self-diffusion, or if the jog is 
made with sudden violence, these vacancies or interstitial atoms will be 
dispersed into the surrounding crystal. 

The purely geometrical basis of this effect can be understood with the 
aid of Fig. 19. Suppose in diagram (a) that slip is to occur along the 
lines AB and CD. Normally one would slip before the other, as in 
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Fig. 19. Forming vacancies by slip on two planes 

diagram (6), where AB has slipped before CD. However, if they both 
slip at the same time, and in the directions shown in diagram (c), a 
hole must be formed at the centre, where they cross. The essential 
condition to make the slip processes on these two planes overlap in 
time is that they occur by the movement of screw dislocations, one in 
each plane, in the crystal at the same time. 

The generation of vacancies or interstitial atoms from a jog pulled 
along with a dislocation is of interest, and SEITZ 39 83 has discussed this 
and some other processes for creating these defects. A jog in an edge 
dislocation may occasionally create a defect by a process illustrated 
in Fig. 20. Here CDE represents an edge dislocation with a jog at D, 
the half- plane being ABCDE, moving in the direction shown. The 
line CD can be regarded as an incomplete row of atoms attached to the 
lower edge of an otherwise perfect half-plane. The end of the row D is 
clearly a place where an atom is likely to become detached from the 
half-plane and left behind in an interstitial position as the dislocation 
moves on, or alternatively is picked up from the lattice and added to 
the row, in which case a vacant site is left behind. In close-packed 
metal lattices, where an interstitial atom has more energy than a 
vacancy, it is likely that the second of these processes will operate 
mostly, so that the incomplete row CD will gradually become longer and 
most of the lattice defects created will be vacancies. 

SEiTZ 39 > 83 and MoTT 23 have summarized the experimental evidence 
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for the generation of lattice defects by cold work. The main items are 
as follows : 

1. GYULAI and HARTLY 85 showed that the electrical conductivity 
of sodium chloride is increased about 100 times after 10 per cent 
plastic strain. This conductivity is believed to be due to 
vacancies and decreases on subsequent ageing at a rate consis- 
tent with this hypothesis. 

2. MOLENAAR and AARTS SG strained polycrystals of copper, silver, 
and aluminium, at the temperature of liquid air and measured 

A 




Fig. 20. A jog D in an edge dislocation CE 

the increase of resistivity caused by this. Ageing at room 
temperature removed nearly one-half of this increase without 
altering the stress-strain curve. SEITZ believes that this 
recoverable part of the resistance is due to vacancies created 
during the cold working process, and that it disappears when 
these condense into aggregates or at jogs in edge dislocations. 
From the magnitude of the resistance change he estimates that 
10 per cent plastic strain creates about 10 18 vacancies per c.c. 
3. DuGDALE 87 has shown that the resistivity of platinum is increased 
about 0'5 per cent by fast neutron bombardment, an effect 
which is almost certainly caused by the creation of vacancies 
and interstitial atoms. This recovers on annealing at the same 
rate as, and with the same activation energy (1-1 eV) as, the 
recoverable resistance caused by cold work. 

THEORY OF THE YIELD POINT AND STRAIN AGEING 

It was shown in T.D.I that the stresses round a dislocation can be 
relieved by a suitable arrangement of nearby solute atoms, and that a 
solute atom with polar co-ordinates r, 6 relative to a positive edge 
dislocation has an interaction energy 

V = A sin 6/r .... (35) 
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with it. Here A is a constant that depends on the elastic properties 
of the material, on the degree of misfit of the solute atom, and on the 
strength of the dislocation. This formula breaks down at the centre of 
the dislocation, where Hooke's law does not hold, and it is expected that 
a large solute atom is most strongly bound at the position = 877/2 and 
r( r ) ~ 2 x 10~ 8 cm which lies immediately below the half-plane 
of the dislocation. For carbon and nitrogen in a-iron one obtains 88 
A ~ 3 X 10~ 20 dyne cm 2 and, at the position of greatest binding, 
F(=F nux )~leF. 

An approximate experimental value can be deduced from the work of 
DiJKSTRA. 89 Using an internal friction method developed by SNOEK 90 
he measured the solubility of nitrogen in iron in the presence of numer- 
ous dislocations introduced by cold work. At 300C this was 0*0025 
weight per cent as compared with 0-012 weight per cent for unworked 
material, this latter being the solubility in the presence of iron nitride 
particles. According to DIJKSTRA the binding energy of nitrogen in iron 
nitride is 0-35 eF, so that F max in dislocations is somewhat greater than 
this, say about 0-5 eV. Another method of estimating F max which also 
gives 0-5 eV is discussed below. 

This interaction can lead to two kinds of segregations of solute atoms 
round dislocations. The first is the dilute "atmosphere" or Maxwellian 

distribution, defined by , T //i/m /o^x 

J c = c exp ( VjkT), .... (36) 

where c is the concentration in a region with a binding energy F and c 
is the overall concentration. This expression is valid when c is small, 
so that the volume change produced by the segregation is small com- 
pared with that produced by the dislocation. As the temperature falls 
this atmosphere becomes more concentrated and below a critical tem- 
perature range it condenses into a line of solute atoms, lying along the 
dislocation line at the position of maximum binding. 88 

An estimate of the temperature T below which condensation should 
occur (if diffusion is still possible) can be made as follows. 91 Suppose 
that there is no shortage of solute atoms for dislocations, i.e. that the 
number c Q N in unit volume of the crystal is large compared with p/6, 
the number of sites to be filled along the lines of maximum binding 
in the dislocations ; here N is the total number of atoms in unit volume 
of crystal, p is the density of dislocations, and b is the atomic spacing. 
Since p/bN ~ pb 2 , which is about 10~ 3 when p = 10 12 cm~ 2 and 10~ 7 
when p r= 10 8 cm~ 2 , this condition is satisfied except in the most 
dilute solutions. Then under these circumstances equation (36) can be 
used to estimate T . At the point of maximum binding we have 
F F max and require that the temperature T Q should be such as to 
give c = 1 at this point. Hence 

TO = V^Jk log (l/ Co ) (37) 
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MoTT 91 has used this to estimate F max . He argues that the yield point 
of iron is not observed above about 700K, and that this temperature 
should be identified with T since a condensed atmosphere is needed 
for a strong yield point ; 88 then by taking C Q ~ 10~ 4 he concludes that 
F max ~ 0-5 eV for carbon in iron. 

The stress needed to pull a dislocation away from its atmosphere 
depends on whether this is condensed or not. With a condensed one the 
interaction energy per unit length, for a dislocation moved a distance 
x from its line of solute, is 

V(x) = - Ar /(x 2 + r 2 ). .... (38) 

The corresponding force on the dislocation, dV/dx, has a maximum 
at x = r Q /Vs equivalent to a (tensile) yield stress 

. .... (39) 



For carbon and nitrogen in iron this is 6 x 10 10 dyne cm~ 2 . An extra- 
polation 88 of observed strengths to 0K gives 1-25 X 1C 10 dyne cm 2 . 
It is expected that the observed value should be lower, since yielding 
in iron is known to begin at places at high stress concentration, and to 
spread from these through the rest of the specimen in the form of 
Luder's bands. 

A rough estimate of the stress needed to pull a dislocation away 
from a Maxwellian distribution can be made as follows. The quantity 
A/kT (= I) has the dimension of length and, since the density of the 
atmosphere is exp [ (l/r) sin 0], can be regarded crudely as the "radius" 
of the atmosphere. We expect that the force on the dislocation from its 
atmosphere is a maximum when the dislocation is displaced by a dis- 
tance of about I from the centre. All the solute atoms in an area of 
order I 2 are then pulling it back. The force from one of these is A/r 2 , and 
we shall take an average value 1/2 for r. The number of solute atoms in 
the area is of order l 2 c Q N exp (l/r) ~ H 2 c N, per unit length of dis- 
location. The critical stress for the breakaway is then 

a ~(l/b)(4A/l 2 )(7l 2 c Q N)~28Ac Q N/b ..... (40) 

A more accurate calculation 92 leads to the same result as this, but with 
a numerical factor of 17 in place of 28. Since N is of order 1/6 3 , or 
l/6r 2 , this critical stress is essentially the same as that of equation (39), 
but multiplied by a factor of order I0c . The presence of c means that 
in dilute solutions the yield stress for a Maxwellian atmosphere is far 
smaller than that for a condensed one. It is the absence of c in equation 
(39) that makes the yield point for a condensed atmosphere so strong 
even in very dilute solutions. We suppose that the rather exceptional 
behaviour^of iron has its origin here ; that room temperature is well 
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below TQ in this case but above T Q for those metals which do not show 
the yield phenomenon. 

A feature of the theory is that the yield point should depend strongly 
on temperature. There are three effects : 

(a) As the temperature is increased through the range of T the yield 
stress changes from the value given in equation (39) to that in 
equation (40). 

(6) Because the atmosphere, and particularly the condensed one, is 
highly localized round the dislocation, only a small displacement of a 
small length of the dislocation line is needed to create a nucleus of 




Strom 



Fig. 21. Yielding in a single crystal of //-brass. (1) first test; 

(2) immediately reloaded; (3) after strain ageing for two 

hours at 250 r O (after ARDLKY) 

freed dislocation sufficient to enable the applied stress to drag away 
the rest of the dislocation ; the energy needed for such a nucleus is 
small enough for thermal fluctuations to make an important contribution 
to it. 88 

(c) LoMER 93 has pointed out that all the available sites in a condensed 
atmosphere are not filled because the configurational entropy is increased 
if a few remain empty. These empty sites, which weaken the anchor- 
ing of the dislocation by the atmosphere, increase in number as the 
temperature is raised. 

Fig. 21 shows an example of the yield phenomenon obtained in a 
single crystal of /3-brass grown in nitrogen. 94 It shows the characteristic 
sharp fall from the upper to the lower yield point, the absence of this 
on immediate retesting when the material is over-strained, and the 
return of it on strain ageing. Several examples of materials showing the 
yield phenomenon are now known. Soft polycrystalline iron or mild 
steel, containing a little carbon and nitrogen, is the familiar one. Yield 
points have now been obtained in single crystals of iron containing 
carbon or nitrogen, in some cases during the first test made oiy them, 95 96 
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but in others only after strain-ageing treatments have been given. 22 ' 97 
Examples amongst non-ferrous metals are polycrystalline molyb- 
denum, 98 and single crystals of zinc 09 ' 10 , cadmium 101 ' 102 > 103 and 
a- and /?-brass. 94 It is often not possible to demonstrate the existence of 
a yield point in a soft single crystal by means of the tensile test, because 
of difficulties in avoiding stress concentrations, but it can be proved by 
means of a bend test. 104 

The fact that, in iron, larger yield points are obtained in poly- 
crystals than in single crystals, and in fine grained polycrystals than in 
coarse grained ones, has led some authorities to suggest that the yield 
point in iron is caused by films of hard material, probably rich in 
carbon, that encase the grains and prevent the soft metal inside them 
from flowing until, at the upper yield point, they break and release the 
plastic flow. 105 The difficulty with this view is to see how it can lead to 
an appreciable fall of stress at the yield point, and why in practice 
there is such a small amount of plastic deformation before the upper 
yield point is reached. 106 * 107 

A qualitative explanation can be based on the dislocation theory, 107 
as follows. There are three important stresses to be considered : (a) the 
observed upper yield stress cr c , (6) the theoretical breakaway stress a 
(at the temperature of the experiment) for an anchored dislocation, 
(c) the applied stress a /q at which dislocations begin to break away in 
regions of severe stress concentration q. Several experimental facts can 
be correlated 107 on the basis that a /q < a c < or . Yielding begins at a 
few places of high stress concentration, before the upper yield point is 
reached, and creates small regions in which plastic flow has begun while 
the rest of the specimen still remains elastic. The spread of these 
nuclei of yield is resisted by obstacles in the path of the freed dislocations 
in them, the most important obstacles being grain boundaries. The 
upper yield is the stress at which one of these nuclei first becomes able 
to spread into the rest of the material. 

Starting from equation (35) for the interaction energy, a calculation 
can be made of the rate at which solute atoms segregate to a free 
dislocation during strain ageing. The expression obtained by COTTRELL 
and BiLBY 88 for this is 

n(t) - *n (ADt/kTf*, .... (41) 



where n(t) is the number of atoms segregated in time t to unit length 
of a dislocation in a region where the initial number of atoms in solution 
is n Q per unit volume ; D is the diffusion coefficient for these atoms and 
a is a number nearly equal to 3. The physical basis of this formula 
can be understood as follows. The gradient of the interaction energy 
provides a force attracting a solute atom to a dislocation and, migrating 
by thermal agitation, the atom acquires a drift velocity v proportional 
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to it. Since V varies as r~ l , where r is the distance from the dislocation, 
the gradient of V and the velocity v vary as r~ 2 . Thus atoms starting 
at a distance r from the dislocation take a time t proportional to rjv, 
i.e. to r 3 , to reach it. The number that reach the dislocation in a time 
less than this is proportional to the initial number at all distances less 
than r, i.e. is proportional to r 2 , and hence to 2/:J . 
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Fig. 22. The effect of temperature on the rate of strain ageing 
in iron containing carbon (after HARPER) 

The actual rate of strain ageing should become slower during the 
later stages than equation (41) indicates because the dislocations 
become saturated with solute atoms. HARPER IOS made a simple generali- 
zation to include such effects by supposing that the rate of segregation 
decreases in proportion with the amount segregated. This gives for /, 
the fraction segregated, 



/ = } - exp [- <x. P (ADt/kT) 21 *], 



(42) 



where p is the density of dislocations. Fig. 22, taken from HARPER'S 
internal friction measurements of the strain -ageing process, shows that, 
for the segregation of carbon in cold worked iron, log (1 /) varies 
linearly with Z 2/3 as the theory requires. From the slopes of these lines 
the activation energy is found to be 20,000 cal/mol, in agreement with 
the value for diffusion of carbon in ferrite. By fitting equation (42) to 
the experimental results the density of dislocations can be found. The 
values HARPER obtained, e.g. 2-5 x 10 11 cm~ 2 after 10 per/cent strain, 
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agree with those of COTTRELL and CHURCHMAN, 109 who followed the 
strain-ageing process by an electrical resistance method. 

These experiments show that the amount of carbon or nitrogen 
transferred to dislocations during strain ageing in moderately worked 
iron is of the order of 10~ 3 to 10~ 2 weight per cent. But the equilibrium 
solubility of these elements at room temperature is much smaller than 
this, of the order of 10~ 7 weight per cent for carbon and 10~ 5 weight per 
cent for nitrogen. 89 ' no This means that either the solution is super- 
saturated when the strain-ageing experiments are made or that some 
particles of carbide and nitride redissolve in order to feed the disloca- 
tions. Probably both of these occur in practice. It is known 89 * 110 that 
precipitation in unworked iron becomes extremely slow when the 
amount left in solution is no more than 10~ 3 weight per cent, and that 
the metal remains supersaturated to this extent even after one month at 
room temperature. This is probably a result of the higher solution 
pressure of finely dispersed precipitates. A recent study 111 of these 
effects has shown that heat treatments designed to make it difficult for 
dislocations to obtain carbon and nitrogen (e.g. prolonged ageing before 
straining in order to make the precipitates coarse and reduce the super- 
saturation) can slow down the rate of strain ageing by an order of 
magnitude. It was found that the initial rate was closely proportional 
to the concentration of carbon and nitrogen in solution, as equation (41) 
requires. 

WORK HARDENING 

To construct a complete theory of work hardening three steps are 
necessary : 

1. The adoption of some model to represent the structure of the cold 
worked crystal. 

2. The calculation of the stress needed to enforce slip in such a 
structure. 

3. The calculation of the strain needed to produce the structure. All 
three have to be taken if the stress-strain curve is to be calculated. The 
second step is the only one that is even moderately easy. Most theories 
have failed at the first one because the model adopted has not agreed 
with observed cold-worked structures. So far as the writer is aware, 
no one has ever accomplished the third step without the aid of an ad hoc 
assumption. Early theories, some of which were reviewed in T.D.I, 
achieved comprehensiveness by the adoption of a highly artificial 
model, but later ones have usually preferred to explain less in order to 
keep the model as realistic as possible. 

The simple picture of slip is one of translation of thin parallel sheets 
relative to one another along a fixed direction. On such a picture it 
has always been a puzzle to understand why these sheets should not 
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slip right off one another once they have started to slide, i.e. why 
there should be any work hardening. The recent tendency is to uphold 
the primitive notion from which this difficulty springs ; to believe that 
in this simple "laminar" slip there is in fact little or no work hardening 
and that the intense work hardening so often observed in practice is 
caused by more complex or "turbulent" slip processes involving 
inhomogeneous lattice rotations and slip on intersecting planes. 

The main evidence for this view is the striking difference in the work 
hardening of crystals of hexagonal and cubic metals. Many shear 
stress -strain curves of crystals deformed in tension can be approximated 
in the plastic range by the relation 

y = W, .... (43) 

where a is the increase in shear stress, above the initial yield value, 
needed to produce the shear strain y, and where m ~ I for hexagonal 
metals and 2 for cubic ones. The parameter h is the coefficient of work 
hardening; at room temperature it is about 4 x 10 8 dyne cm~ 2 for 
aluminium, 5 X 10 7 for magnesium, and 2*5 x 10 7 for zinc. The 
difference between metals of cubic and hexagonal structures does not 
stop at this. With zinc and cadmium crystals the slip bands all appear 
early in the deformation, whereas with aluminium they increase in 
proportion with the applied stress. In a detailed study of tensile defor- 
mation itoNEYCOMBE 112 showed that x-ray Laue asterisms are observed 
from aluminium crystals (except those oriented to deform in multiple 
slip) but not from cadmium ones. In these aluminium crystals neither 
the work hardening nor the asterisms could be eliminated except by 
annealing at temperatures where recrystallizatioii occurred. The 
cadmium crystals, however, would not recrystallize and the small 
amount of work hardening that could be introduced in them by tensile 
straining was quickly removed by mild annealing. 

Some of this evidence has been available for many years, but not 
until recently has it seemed feasible to draw the above conclusion from 
it. For it immediately leads to the following difficulty. Admittedly, a 
cubic crystal oriented to deform from the start by double slip may well 
behave differently from one of cadmium where double slip is not 
observed ; but it is easy to obtain orientations where cubic crystals 
should deform at the start by single slip, and why do these behave 
differently from the hexagonal ones? TAYLOR 113 (see also references 
35 and 114) showed that strain hardening in aluminium is more rapid 
when the crystal deforms from the start in double slip than in single 
slip, but this difference was only about 20 per cent and quite trivial 
when compared with the difference between cubic and hexagonal 
crystals. 

Again, the recent tendency is to uphold the notion upon which the 

244 



THEORY OF DISLOCATIONS 

difficulty rests ; to say that when a cubic crystal is made to deform in 
pure single slip it does so like a hexagonal one and shows little hardening. 
However, just as it is difficult to prevent fast laminar flow in a fluid 
from breaking up into turbulent flow, so it is difficult, with a cubic 
crystal, to prevent laminar slip from breaking up into turbulent slip. 
It is not sufficient merely to choose an initial orientation suitable for 
single slip. Other steps are also necessary and if these are not taken the 
crystal generally does not behave in the simple way expected of it. 
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Fig. 23. Contrast between behaviour of a silver crystal ( 1 ) with a clean 

surface but exposed to air for 2 hours at room temperature, and (2) a 

similar crystal tested in argon. Both tested at room temperature (after 

ANDKAUE and HKNDKKSON) 

The main evidence for this view is the effect known as "easy glide" ; 
under certain conditions the plastic part of the stress-strain curve of a 
face centred cubic crystal begins with a region where the hardening 
coefficient is abnormally small, or even zero, after which the curve rises 
steeply to give the intense work hardening typical of such crystals. The 
observations of this are as follows : 

(a) ANDRADE and HENDERSON 115 observed such regions in crystals of 
pure gold and silver tested in tension, and concluded that easy glide is 
favoured by high purity, complete absence of surface contamination 
(e.g. oxide films), and a low temperature of deformation. Fig. 23 shows 
the remarkable effect that exposure to air has upon the work hardening 
of silver crystals. 

(6) MADDIN, MATHEWSON and HiBBARD 35 showed that some single 
crystals of oc-brass deform on a single set of slip planes with practically 
no hardenii g until the stage is reached where double slip begins, when 
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hardening becomes intense. Vacant atomic sites should be produced in 
large numbers, by the crossing of screw dislocations, once double slip 
begins and there is some indirect evidence for this in the observations of 
MASIMA arid SACHS, 11G who showed that during single slip the change in 
electrical resistance and density was small but became substantial when 
double slip began. 

(c) Crystals of aluminium stretched in tension show easy glide pro- 
vided they are pure and have orientations which avoid double slip. 117 * 118 




Ohde 7, 

Fig. 21. Stress-strain curves on pure aluminium crystals 
(after LUCKE and LANGE) 

Fig. 24, taken from the work of LUCRE and LANGE, IIS shows stress-strain 
curves on two crystals of pure aluminium (99-99 per cent), one of which 
( A) deformed initially by single slip and showed a region of easy glide 
in which hardening was very small, while the other (B) deformed by 
multiple slip and began hardening immediately. MASING and RAFFEL- 
siEFER 117 showed that the region of easy glide became smaller the 
nearer the initial orientation was to one giving multiple slip, and that 
it was suppressed entirely in crystals of less pure aluminium (99-5 per 
cent) ; these gave roughly parabolic curves and intense work hardening 
for all initial orientations. 

(d) In an attempt to avoid plastic bending at the grips of a specimen, 
ROHM and KocHENDORFER 119 strained long crystals of pure aluminium 
(99-99 per cent) in a special shearing apparatus. They found that this 
produced simple laminar slip which resembled slip in hexagonal metals 
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in that it gave a linear stress-strain relation, a small work hardening 
which could be removed easily by mild annealing, and no x-ray 
asterisms. 

(e) PitATT 120 studied the deformation of sodium chloride crystals by 
means of special optical methods. Two kinds of stress-strain curves 
were obtained, steeply rising parabolic ones on crystals that deformed 
throughout on intersecting slip systems, and less steep and roughly 
linear ones on crystals in which, after a small initial period of double 
slip, only one slip system was active. 

The impression gained from these results is that a cubic crystal 
oriented for single slip will start to deform in this ideal manner and give 
little work hardening, as would be expected. However, sooner or later, 
something happens that breaks up the flow into a more turbulent kind 
that gives intense work hardening. By taking certain precautions one 
may delay the onset of this second stage, or even postpone it, but if these 
are not taken it begins so soon that the "easy glide" characteristic of 
ideal single slip is not observed. In some cases the onset of turbulent 
flow is delayed until double slip clearly begins (a-brass) but this is not 
true of aluminium stretched in tension, where it usually begins within 
the first few per cent extension ; one cannot be sure, of course, that 
small amounts of slip on "unpredicted" planes do not occur during this 
stage. However, the structural feature which appears to be connected 
with the change in the nature of the flow in this case is the deformation 
band or kink band. HONEYCOMBED showed that these begin to appear 
after 1 per cent extension in crystals that deform by single slip, but not 
in those that deform by double slip. 

Detailed studies of deformation bands have been made by CAirN, 37 
HONEYCOMBED CHEN and MATHEWSON, 121 and LALOEUF and CRUS- 
SARD. 122 Fig. 25 is an example of a band. Tt consists of a thin sheet 
extending across the crystal. At small strains the sheet is plane and 
normal to the active slip direction. It forms an obstacle to slip and 
many slip bands fail to penetrate it, the piled up dislocations in them 
producing the characteristic double bend of the lattice illustrated in 
Fig. 26. The intense stresses from these cause slip to occur on other slip 
systems in the band, and these intersecting slip bands become clearly 
visible after strains of about 20 per cent. The spacing between deforma- 
tion bands is commonly about 10~ 2 to 1C" 1 cm and the change in 
orientation across a band is sufficient to account for the x-ray Laue 
asterisms observed from crystals containing bands. Crystals which 
deform entirely in multiple slip develop the bands either not at all or 
on too fine a scale to be seen. 

The process by which the bands form is still obscure. The two main 
views are that they start from bend planes which run into the crystal 
from the e^ds, having been formed by the lattice bending caused by 

247 



PROGRESS IN METAL PHYSICS 

grip constraints, 37 ' 112 120 m or that they grow by the successive 
trapping of dislocations moving in slip planes near to one containing 
an obstacle against which dislocations have piled up. 23 84 Deformation 
bands bear a geometrical resemblance to the kinks produced in hexa- 
gonal metal crystals by compression along an axis in the basal plane. 
FRANK 123 has suggested that macroscopic kinks can grow from short ones 
because the stress concentration at the end of an incomplete wall of edge 
dislocations can rise sufficiently to create new dislocations in the perfect 
lattice beyond it. A wall of edge dislocations which extends part of the 
way through a crystal is equivalent to a wedge, of atomic sharpness at 





Fig. 20. "Piling of dislocations against a deformation band (a) before 
and (b) after polygoiiizatiori 

its edge and composed of the half-planes of these dislocations, which is 
driven into the crystal. 

The above discussion points to the conclusion that strong work 
hardening in cubic metals originates in the interactions among dis- 
locations on intersecting slip bands. The arguments for this are as 
follows : 

(a) This kind of hardening is only observed in crystals capable of 
slipping on intersecting planes. 

(6) It is only observed in these crystals under circumstances that 
favour slip on intersecting systems, i.e. when the orientation is suitable 
for multiple slip, or when deformation bands are present. 

(c) It does not seem possible to achieve the stability which the rela- 
tive permanence of the cold worked state requires by means of purely 
Hookean interactions amongst slip dislocations in parallel planes. A 
dislocation in an array such as those considered by TAYLOR and 
KOEHLER (T.D.I, page 105) is at rest only so long as the net shear stress 
acting on it is zero. It is difficult to design arrays for which this stress is 
precisely zero on all dislocations in both the presence and absence of 
applied stress, and most arrays that one might expect to form turn out 
on analysis to be mechanically unstable unless some of their dislocations 
are permanently fixed in position and not merely allowed to /;est because 
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Fig. 25. A deformation band in aluminium: crystal 
extended 20 per cent (after CAHN) X 250 
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no shear stress acts on them. It is not necessary that all dislocations 
should be anchored, and the elastic after-effect described earlier shows 
that some of them are in fact mobile. What is required is that some 
dislocations in key positions should become incapable of moving and 
then act as obstacles confining the other dislocations, which are still 
mobile, to move within restricted areas of their slip planes. 

Two mechanisms involving slip on intersecting planes which would 
provide such locking have been suggested. The first is that of LOMER 
and COTTRELL, described earlier, in which slip dislocations are turned 
into sessile ones by combination with slip dislocations on intersecting 
planes. The other mechanism, also described earlier, is that which 
leads to the formation of jogs. A dislocation may be able to glide freely 
in one slip plane but nevertheless will be an obstacle to dislocations 
moving on intersecting planes if, to be carried along with these, it has to 
glide out of its plane. A row of such dislocations, as shown in Fig. 18, 
holding up slip in an active plane provides a suitable obstacle for work 
hardening. Furthermore, as we have noted earlier, the hardening 
caused by such obstacles is sensitive to temperature, and this is observed 
in practice. The density of intersecting dislocations needed to account 
for the observed hardening in heavily strained crystals is in the range 
10 10 to 10 12 cm" 2 , and there is some independent evidence for such 
densities (T.D.I, page 104, also references 108 and 109). 

Hardening due to the mutual interference of dislocations in different 
slip planes should, of course, be most severe in crystals oriented for 
multiple slip. However, it should not be thought that this kind of 
hardening is absent in a crystal which shows only one set of visible slip 
bands. What matters is the density of dislocations in intersecting slip 
planes and this is not the same as the total amount of slip on these planes. 
This point of view is brought out in a theory which is presented below. 
Even with an orientation suited for double slip there is a tendency to 
develop a patchwork grouping of regions in which slip is predominantly 
on one plane, as HoNEYCOMBE 112 showed with his ' 'bands of secondary 
slip." Such a clustering of parallel slip planes into packets is to be 
expected since the array of similar dislocations built up in a cluster is 
difficult to penetrate by dislocations approaching along a different 
family of slip planes. 

The ideas outlined above provide the elements of a theory of harden- 
ing for turbulent slip, but not for laminar slip, the theory of which has 
still to be developed. Various suggestions have been put forward by 
KuiiLMANN, 124 SEiTZ, 83 OttOWAN, 21 and MoTT 23 84 which may prove 
important for this. An attempt to develop the theory of turbulent 
hardening quantitatively has been made by MoTT, 23 whose theory will 
now be described. 

The theory is two-dimensional and based on the piling up of edge 
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dislocations against obstacles spaced at intervals of 2L along their slip 
planes. At each obstacle a group of n dislocations of the same sign is 
piled up and anchored by combination with a few dislocations on inter- 
secting planes which have been forced into activity to relieve the high 
stresses round the group. Any slip on secondary planes other than these 
small and highly localized movements is not essential to the theory. 
Each group acts at large distances like a "super-dislocation" of strength 
nb, and its stress field reaches n times as far, at the same intensity, as 
that of a unit dislocation. This is held to account for the wide spacing 
between the slip bands, such as is observed in aluminium. 26 The picture 
of work hardening which emerges is similar to that of TAYLOR (T.D.I) 
but with the following main differences : (a) the dislocations are 
anchored, (b) the strength of each dislocation is n times larger, (c) the 
spacing of them is correspondingly larger, (d) the dislocations are not 
necessarily arranged in any regular array. Let the distance between the 
active slip planes be d. Then the density of the "super-dislocations" is 
(Ld)- 1 , and the mean distance between them is (Ld) 112 . Each exists in a 
field of internal stress due to its neighbours given approximately by 



* 27r(Ld) 11 *' .... 

The plastic strain is p(nb)L, or, substituting p = (Ld)~ l , 

y = nb/d. .... (45) 

Eliminating d from these relations gives a parabolic relation between 
a t and y, 

a i = ( / a/27r)(7iy6/ly) ly ' 2 , .... (46) 



similar to that of TAYLOR'S theory (T.D.I, equation (23)). 

To eliminate the somewhat arbitary parameter L in this relation, 
MOTT introduces his idea of the dynamic generation of loops at a 
Frank-Read source. The first time a source of length / is made to work, 
a stress ACT ~ jub/l is needed, in addition to the frictional stresses due 
to impurities etc, but thereafter the source works spontaneously until 
the back-stress on it has cancelled this stress. The number n of dis- 
locations needed to do this from a distance L is given by Ao* ~ n/ib/2TrL t 
which implies, since ACT ~ /tb/l, that n/L ~ 2?7//. Substituting this in 
the stress -strain relation gives 

a, = v(yb/27Tl)V 2 . .... (47) 

MOTT emphasizes three things about this result : 
(a) When I is constant it gives a parabolic relation between stress and 
strain, as is observed. 
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(6) When I ~ 10~ 4 cm, which is reasonable, the coefficient of work 
hardening is of the right order of magnitude. 

(c) The stress-strain relation does not depend on L, the spacing of 
the obstacles. This is important since the intensity of turbulent work 
hardening is roughly the same in crystals of different orientations. 

To account for the dependence of work hardening on temperature, 
MOTT takes up an idea due to BROWN, 26 and suggests that vacancies 
created in an active slip plane during plastic flow migrate into neigh- 
bouring planes, up to 100 atoms away, and soften these sufficiently to 
cause them to start slipping. This is also held to account for the stepped 




\D 



Fig. 27. Formation of steps on a slip band 

structure of slip bands observed by HEIDENREICH and SnocKLEY, 25 and 
BROWN. 26 However, alternative explanations are possible. We have 
seen that the hardening caused by intersecting dislocations is sensitive 
to temperature. As regards the formation of steps in slip bands, the 
suggestion has been put forward independently by FisiiER 24 and the 
writer that these are caused by the displacement of outlying parts of 
active slip planes, relative to the central ones, as a result of slip on inter- 
secting planes. This process is illustrated in Fig. 27. Here a slip band 
ABC emerges on the surface at (}. At an earlier stage in the deformation 
B'C' was the continuation of the line AB of the band, but this part has 
now been moved up by slip on an intersecting plane DE. Since B'C' 
is now cut off from the Frank-Read source generating the band no more 
slip can take place on this step. This effect of the cessation of slip on 
existing steps, when new ones are formed, has been observed by 
BROWN, 125 who calls it "sterile slip." 

ANNEALING 

In T.D.I a theory was given which explained why polygonization occurs 
in a plastically bent crystal, and why polygon boundaries are perpendi- 
cular to the active slip direction, from the elastic properties of edge 
dislocations. It did not explain how polygonization occurs, however. 
Two pointy are important here : (a) in the freshly bent crystal the 
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dislocations are grouped ' 'horizontally" in slip planes, whereas after 
polygonization they are grouped "vertically" in walls ; to produce this 
change they must climb out of their slip planes besides gliding along 
them ; (6) polygonization only occurs rapidly at temperatures where 
self -diffusion is also rapid. As MoTT 84 has pointed out, this means that 
polygonization must occur by the addition or removal of matter at jogs 
on the edges of the half-planes by atomic migration. The rate at which 
a jog gives off a vacancy can be written as c exp ( W/kT), where W is 
the activation energy for self-diffusion and where the frequency factor 
c contains a term proportional to the force pushing the edge dislocation 
out of its slip plane. 84 If n/b is the number of jogs per unit length, 
where b is the atomic spacing along the edge, the velocity with which the 
dislocation climbs away from its slip plane is 

nbc exp (- W/kT). .... (48) 

As it climbs, the dislocation has to be replenished with jogs since 
the existing ones run to the end of the line. New jogs can be made by 
the action of thermal fluctuations, in which case the equilibrium density 
of jogs along the edge is given by the relation n exp ( U/kT), 
where U is the work of formation of a jog; the activation energy for 
climbing is then increased to U + W, and polygonization cannot occur 
rapidly except at high temperatures. Alternatively, the dislocation may 
glide slowly under an applied stress in its slip direction, while it is 
climbing and so generate its own jogs by cutting through intersecting 
screw dislocations. In this case the activation energy for climbing is 
only W, not U + W, and polygonization occurs at lower temperatures. 
This is perhaps why cell-structures, very reminiscent of complex 
polygonization boundaries, are formed during slow creep in poly- 
crystals at temperatures much lower (e.g. 250C instead of 600C) than 
are necessary to produce them by annealing after cold work. 126 The 
fact that these cell-structures have crystallographically irregular 
boundaries is not an argument against their being formed by poly- 
gonization, since the mutual constraints of neighbouring crystals force 
slip to occur by dislocations with more than one kind of Burgers vector ; 
SHOCKLEY and READ 55 have suggested, and LOMER and NYE 65 have 
confirmed with the bubble raft, that the equilibrium orientation of a 
small-angle boundary is always perpendicular to the combined Burgers 
vector of the dislocations in it. 

Apart from this progress in the theory of polygonization, the develop- 
ment of a structural theory of recovery has been held up while the 
nature of the cold- working process is being clarified. In the meantime, 
a formal theory of recovery has been developed by KUHLMANN SI 124 
and COTTRELL and AYTEKIN, 127 in which the basic idea is that the rate, 
da/dt, at which the yield stress a recovers on annealing i controlled 
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by an activation energy, Q fia, that depends linearly on the yield 
stress. An analysis similar to that used to derive equation (30) then 
shows that the yield stress decreases logarithmically with the annealing 
time, a relation which has been confirmed experimentally. 127 ' 128 > 129 

Recrystallization and grain growth both occur by the migration of a 
grain boundary in a direction normal to itself. The driving force in the 
first process is the difference in energy between the cold worked and 
annealed grains which meet at the boundary; in the second one it is 
the surface tension of the boundary. The velocity with which a boundary 
moves depends, of course, on the driving force, as the theories of MoTT 130 
and TuRNBULL 131 show, but evidence has been accumulating recently 
that the enormous range in the mobilities of different grain boundaries 
is due, not so much to different driving forces, but to differences in 
orientation. It is convenient here to distinguish four types of boundary : 

(1) The small-angle (e.g. < 1) simple Burgers boundary (Fig. 13), 
containing one family of parallel and uncontracted edge dislocations 
(see page 227). 

(2) As (1), but with larger change of angle (e.g. 5 < < 20) and 
contracted dislocations. 

(3) The small-angle or medium-angle unsymmetrical boundary con- 
taining dislocations with more than one kind of Burgers vector. 

(4) The large-angle (e.g. ~ 30) incoherent boundary across which 
there is no continuity along lattice rows, and which may be imagined 
in terms of a monomolecular layer of "liquid" between the crystals. 

The high mobility of type (1) boundaries has been demonstrated in 
the bend plane experiments of PARKER and WASHBURN, 58 in the bubble 
raft experiments of LOMER and NYE, 65 and in the observations of 
GUINIER and TENNEViN, 132 and DUNN and DANIELS, 133 on polygoniza- 
tion. Type (2) boundaries, on the other hand, do not appear to be very 
mobile, as shown by the fact that the growth of polygonal domains 
becomes much slower when the angle between them increases. 133 
LOMER and NYE 65 have argued that boundaries made from contracted 
dislocations should be immobile because the Peierls-Nabarro force 
(T.D.I, page 90) resisting the motion of a dislocation increases rapidly 
as the width of the dislocation is reduced. 66 

Boundaries of type (3) are often found in worked poly crystals 126 ' 134 135 
and there is much evidence for their immobility. TIEDEMA, MAY and 
BURGERS, 136 and BECK, SPERRY and Hu 137 showed that small-angle 
boundaries between strained crystals and adjacent strain-free ones of 
almost identical orientation had very low mobilities ; the presence of 
small unconsumed grains in single crystals grown by the strain-anneal 
method is due to this effect. Similarly, the domain structures observed 
by KELLAR, HIRSCH and THORP, 134 and by HEiDENREiCH, 135 in cold 
worked pqlycrystalline aluminium do not coarsen on prolonged ageing 
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even though they form practically instantaneously during working and 
are on a very fine scale (~ 2 x 10~ 4 cm). The difficulty of moving 
such boundaries lies in the facts that the glide motions of their disloca- 
tions He along different directions, and that the climbing motions of these 
dislocations involve diffusion through the regions of "good crystal" 
which lie between them. 55 * 65 

The mobility of large-angle boundaries is shown by the well-known 
observation that, in recrystallizatiori and grain growth, the new crystals 
have radically different orientations from those they replace. It seems 
that the development of annealing textures has its origin here. BECK, 



t Time 

Fig. 28. Increase in the size of a growing crystal 
during recrystallization 

S PERRY and Hu, 137 for example, showed that the new crystals which 
grew fastest into a worked crystal were rotated, relative to its orienta- 
tion, by 30 to 40, about a [111] axis. A large-angle boundary should 
move rapidly by the transference of atoms, singly, from one crystal to 
the other, since each atom in such a boundary is next to a defect similar 
to a vacant atomic site. The movement bears a resemblance to that of 
diffusion in a grain boundary and, as LOMER and NYE 65 point out, this is 
consistent with the observation of ACHTER and SMOLUCiiowsKi 138 that 
grain boundary diffusion becomes much faster when the boundary 
angle is increased beyond 20. 

This dependence of mobility on orientation can be used to explain 
another effect in recrystallization. It is known that recrystallization 
starts slowly, 139 ' 14 and that measurements of the linear growth of a 
new grain give a curve of the kind shown in Fig. 28. Over a large range 
the growth increases linearly with time and the rate only falls away 
from this constant value when the new grains impinge on each other. 
However, if the linear part is extrapolated back linearly to zero dia- 
meter it meets the time axis at some positive value , and this has led 
to the idea that recrystallization is preceded by an earlier ^tage when 
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nucleation of new crystals occurs. In the writer's opinion this extra- 
polation is misleading ; there does not seem to be any certain evidence 
that distinctly different processes occur during ' 'nucleation" and growth. 
A reasonable view is that of BURGERS and LouwERSE, 141 and CAHN, 142 
who suppose that the new grains are formed by the growth of initially 
small pieces of the old ones. Now the boundary dividing a growing 
piece from its surroundings is a small-angle one at first, and so its 
velocity of movement will be smaller than at a later stage of growth, 
where the growing crystal will have advanced into more remote regions 
of the crystal, regions which have radically different orientations from 
its own and which make large-angle boundaries with it. 

On this view, the slow start of the recrystallization process is explained 
by a slowness in the velocity of boundary migration during the early 
stages of growth ; thus the linear curve ought not to be extrapolated 
straight back to in Fig. 28, but curved back into the origin, as shown 
by the alternative extrapolation. This view also resolves a difficulty in 
BURGERS' and CAHN'S theories, which is to understand why re-crystalli- 
zation begins in the most intensely curved regions of the crystal, in spite 
of the fact that they are the most heavily strained regions and the 
driving force for growth should be least favourable there. We suppose 
that growth starts from them because only there is the orientation so 
widely different from average that a grain growing out of one of them 
can obtain for itself a large-angle boundary ; unless it can do this it 
cannot grow at a sufficient speed. This also helps to explain why 
crystals deformed in laminar flow do not recrystallize, on the grounds 
that the orientation in them does not vary widely enough to allow 
large-angle boundaries to be made ; similarly it explains why recrystal- 
lization starts so readily from a scratch or similar sharp surface indenta- 
tion. As CAHN 142 has already pointed out, the early stages of recrystal- 
lization should be recognizable as polygonization, and TiEDEMA 143 has 
recently obtained some direct evidence for this. 

CREEP 

The confirmation of some of the general predictions of the exhaustion 
theory of creep, described in T.D.I, has led to further investigations 
with the result that the theory has been both broadened in scope and 
simplified in treatment. We shall now sketch a version of it based on 
the following assumptions : 

( 1 ) The flowing specimen can be regarded as an aggregate of elements ; 
in many but not all creep problems the elements are dislocation lines. 

(2) Each element has an activation stress a a (= yield stress of the 
element a y applied stress a) which has to be applied before it flows. 

(3) With this activation stress is associated an activation energy 
U(a a , a) wjhich is an increasing function of a a . 
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(4) When an element is thermally activated by a temporary acquisi- 
tion of the energy [/, it makes a jump, i.e. contributes an increment v 
to the overall plastic strain. 

(5) U(a a , a) is the same function of a a for all elements ; in particular 
we shall make the assumption 

U(a a ) = Aa a , .... (49) 

which is consistent with activation energy function of equation (34). 
This function is appropriate when the hardening responsible for a a is 
caused by intersecting dislocations ; with other kinds of hardening other 
functions (e.g. U oc cr a 3/2 for precipitation hardening; see T.D.I) may 
be better. 

(6) The increment of strain v is the same for all jumps. 

(7) The elements are independent of each other and cannot jump more 
than once. 

Let N(a a , t)da a be the number of elements with activation stresses 
between a a and a a + da a at time t. Then the rate of creep at this time 
is given by 



*r a , .... (50) 

o 
where 

f(a a ) = v exp {- U(a a )/kT} .... (51) 

is the frequency of jumping per element, and v is a frequency factor. 
Since each element jumps only once, the numbers of unused elements 
are exhausted at a rate 



), .... (52) 

which integrates to give the time dependence of N(a a , t) as 



-f(o a )t}, (53) 

where N(a a , 0) is the initial distribution. 

With a suitable assumption for N(a a , 0), e.g. 

N(a a , Q) = N(0<o a ^ crj 

= (cr w <cr fl ) .... (54) 

and with the energy function of equation (49), the integral in equation 
(50) for the creep rate is elementary. However we shall use another 
method which is only approximate but has greater generality and is more 
suggestive physically. With reasonable values for the physical para- 
meters involved, the function exp { f(a a )t} is practically a step 
function of a a when vt ^> 1. For small values of a a it is nearly zero, and 
for large ones nearly unity. Only in a very small range of e? a , centred 
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about the value for which f(a a )t = 1, does it change from the one 
extreme to the other. Thus the exhaustion process can be regarded as 
the advance of a sharp step or thermal front along the cr axis. As the 
step sweeps across each part of the N(a a , 0) curve all the elements in 
that part are activated and make their contribution to the creep. In a 
time t the step arrives at the activation stress a t defined by 

U(a t ) = kT log vt, .... (55) 

or, if equation (49) is used, 

a t = (kT/A)logvt. (rt>l.) .... (56) 

The creep strain at time t is then 



(a a9 0)da a9 .... (57) 

o 
or, with N(a a , 0) = N, 

y(t) = (vNkT/A) log vt. (^>1.) .... (58) 

At time t the stress 

a v = G t + <r .... (59) 

is the "yield stress" of the material in the sense that, if the applied 
stress could be raised to this value, flow could occur without the aid 
of thermal fluctuations. By making sudden small increments of applied 
stress during a creep experiment the presence and position of this 
thermal front can be examined. 

Two investigations have been made recently to check the exhaustion 
theory. DAVIS and THOMPSON 144 used a precipitation -hardened copper- 
silver alloy and showed that their creep curves obeyed a time law of the 
kind y oc (log vt) 213 , which is consistent with the exhaustion theory if 
the activation energy function U oc cr a 3/2 appropriate to a precipitation- 
hardened alloy is used. By means of incremental loading tests they also 
demonstrated the existence of a critical stress, i.e. the existence of the 
thermal front, which could be reached by a suitable load increment 
(e.g. an increment of 1-4 kg mm~ 2 after creep for 24 hours at room tem- 
perature under a constant applied stress of 6 to 8 kg mm~ 2 ) and at which 
"instantaneous" plastic flow was produced. This investigation also 
showed that the magnitude of the activation energy expected from the 
precipitation-hardening theory (see T.D.I) was of the right order, but 
that the observed value (~ 1 sec" 1 ) of v was far smaller than that 
expected (~ 10 8 sec" 1 ) for the vibration of a dislocation against a 
precipitate. This discrepancy has still not been satisfactorily explained. 

The other investigation was that of WYATT, 145 146 who used copper 
hardened by the working received during the creep process itself. At 
low temperatures (i.e. < 300K) his specimens obeyed the simple 
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logarithmic creep law of equation (58), and he proved that, under these 
conditions, a mechanical equation of state 

f=f(a,y,T) .... (60) 

was obeyed. At constant temperature T, for example, this means that 
the rate of strain y at any given instant depends only on the current 
values of the applied stress a and strain y at this instant, and not on 
past values. Essentially the method of proof was to make sudden 
increments of applied stress during the creep experiment and to show 
that the portions of the creep curve which followed such increments 
could always be fitted exactly on earlier portions of the curve by sliding 
them backwards by a suitable amount. From the distance along the 
log/ axis that it was necessary to slide the curves back, WYATT con- 
cluded that the activation stress increases linearly with log, which 
implies a linear relation, equation (49), between activation energy and 
activation stress. A further analysis of WYATT'S results 147 shows that 
the activation energy function in these experiments is of the form 

U(o a , a) = Saja, .... (61) 

which is almost exactly what would be expected from the theory of 
intersecting dislocations ; this can be seen by substituting B oc//fe 3 , 
(cLjub/l) a o a , and approximating a + a a ~ cr, in equation (34). 
Moreover the value of B deduced from the experiments 147 is about 4 e V 
which is a very reasonable magnitude for a/^6 3 . These experiments of 
WYATT therefore lend good support to the idea that work hardening is 
caused by intersecting dislocations having to cut through one another. 

At higher temperatures and larger creep strains logarithmic creep 
usually gives way to a combination of AKDBADE'S 1/:3 creep and steady- 
state creep. WYATT has shown that a mechanical equation of state is 
not obeyed during 1/3 creep, so that any change made in the exhaustion 
theory to enable it to deal with these other forms of creep would have 
to be a profound one. Such a change has been suggested recently, 147 
based on the idea that once an element has jumped, it is not necessarily 
unable to jump again, but faces an energy barrier which is uncorrelated 
with the one it has jumped and is chosen at random from the whole 
range of barriers. This idea is suggested by the model of a dislocation 
meeting a random sequence of obstacles during its glide motion through 
a slip plane. It leads to a creep function which contains a steady-state 
term and has a transient term rather similar to, but not identical with, 
the 2 1/3 function. Except at extremely small strains, where the creep 
function approximates to the logarithmic type, a mechanical equation 
of state is not obeyed. 

A formal theory of steady-state creep, the so-called recovery theory, 
can be built up on the idea that the creep rate remains constant because 
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work hardening and thermal softening are balanced against each other 
during this kind of flow. 148 ' 149 > 127 Thus GROW AN 149 defines a coefficient 
of work hardening, h = da/3y, and a rate of recovery, r = 3a/5t. 
During steady-state creep the yield stress remains constant, 

9o* , c)cr , 
da^~-dy + dt = 0, 

cy at 

so that the rate of steady flow is given by 

K = y = rill. .... (62) 

A consequence of the theory is that if the applied stress is suddenly 
reduced by a small amount, during steady flow, there should be a pause 
in the flow before the new steady rate is taken up, since the yield stress 
has first to fall by recovery by the amount that the applied stress was 
reduced. The existence of these induction periods has now been 
confirmed. 127 ' 15 COTTRBLL and AYTEKiN, 127 for example, showed with 
zinc crystals that the observed time of induction can be accounted for 
approximately, with the aid of equation (62), in terms of the reduction 
in stress, the rate of steady flow, and the coefficient of work hardening. 
They also observed that the dependence of K on applied stress and 
temperature could be represented by the relation 

K = C exp {- (Q - pa)lkT} .... (63) 



where the parameters Q and /?, when measured in a standard range of 
or, had the same values as in recovery experiments on the same material. 
Since equation (62) allows us to write (}/h)(da/dt) for AC, equation (63) 
is essentially the differential equation for recovery which, as mentioned 
on page 253, integrates to give the logarithmic time law of softening. 

The explanation of the final stage of the creep curve, in which the 
rate of flow increases progressively to fracture, remains an unsolved 
problem. Various suggestions have been put forward invoking effects 
due, for example, to changes in cross-section, opening of cracks in 
heavily worked slip planes and near grain boundaries, recrystallization, 
and metallurgical phase changes ; accounts of these have been given in 
recent books. 151 However, there is no chance of making a detailed 
quantitative or even qualitative, theory of the late stages of creep until 
we have a more intimate picture of the various structural changes which 
occur during a creep experiment, which accounts for the great contem- 
porary interest in metallographic studies of creep. 126 ' 127 152 

Two features which have claimed most attention are the cell- 
structures formed within the grains, and the sliding of grains past one 
another along their boundaries. The nature of cell-structures and their 
relation to polygonizatioii have already been discussed on page 252. 
Slip along grain boundaries has been studied directly by observing the 
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sliding of one crystal over another in a bicrystal, 153 and indirectly by 
means of internal friction measurements. 154 * 155 The advantage of this 
second method is that the movements involved are very small so that 
extraneous effects, due to interlocking of crystals at places where the 
boundaries change their direction, are reduced. On the other hand, 
because it is an indirect method of study, the interpretation of its results 
is less certain. Theories have been developed, for example, in which the 
grain boundaries are considered to act either as surfaces along which 
slip occurs, 130 * 149 154 rather as if they consisted of monatomic layers of 
liquid, or as sources and sinks of vacancies, in which case creep is 
caused by the migration of these vacancies through the grains from one 
boundary to another. 156 ' 157 > 158 

Direct observation of displacements at grain boundaries 159 and of 
plastic deformation 160 and cracks 159 in grains at the ends of such 
boundaries leave no doubt that grain boundary slip is a most important 
feature of creep in poly crystals at high temperatures. These boundary 
movements cannot occur without some sympathetic plastic deformation 
at the corners where grains interlock and the final rate of flow must 
depend on this deformation within the grains, even although it is 
started by slip along the boundaries. The nature of the interaction of 
slip on the boundaries with flow in the grains is the problem which 
must be solved if the accelerating stage of creep is to be properly under- 
stood, or if the theory of creep is to be of practical value in the design 
of creep-resistant alloys. Here, as is usual in these more complicated 
problems of crystal plasticity, one looks to metal lographic examination 
to provide the vital clues to inspire the theory, and there are signs from 
the recent work 126 * 152 > 16 that they will not be long forthcoming. 
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6 
DIFFUSION IN METALS 

A. D. Le Claire 

THE last few years have been witness to a widening interest in the 
phenomenon of diffusion in metals and it is the purpose of the present 
chapter to summarize the more important advances that have been 
made, particularly since the time of writing of Volume I of this series, 1 
which will be referred to hereafter as PMP.l. 

Of greatest importance to our general understanding of chemical 
diffusion in poly crystalline materials, the case of most practical 
interest, has been the work on the Kirkendall Effect and on grain 
boundary diffusion processes and the first two sections are devoted to 
these topics. A knowledge of the fundamental atomic process by which 
diffusion occurs is essential for a complete understanding of any diffu- 
sion phenomenon and the discussion of this topic forms the subject of 
the next section. This includes an account of the first appreciable 
successes which have been achieved in relating experimentally deter- 
mined diffusion coefficients to theoretical expressions for them. A few 
new and interesting methods for measuring diffusion coefficients are 
described in Section IV and in the last section an account is given of 
recent developments in the phenomenological and kinetic treatments of 
chemical diffusion. 

Perhaps the most important result of the last few years work has 
been to provide a clearer understanding of the nature of the problems 
involved in studying diffusion in metals and so to point the way to the 
type of experiments which are most likely to further progress. 

SECTION I 

THE KIRKENDALL EFFECT 

The original experiments of SMIGELSKAS and KrRKENDALL, 2 revealing 
what has now come to be known as the "Kirkendall Effect," were 
performed in 1946 and reviewed in PMP.l. Since then a considerable 
amount of experimental work has been devoted to confirming their 
results, to examining them in more detail and to demonstrating the 
occurrence of the Kirkendall effect in an extensive range of systems. 

The importance of the Kirkendall effect lies in the assistance it gives 
to elucidating the precise atomic mechanism of diffusion, in its demon- 
strating the inadequacy of a single diffusion coefficient for fully 
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describing the diffusion behaviour of a binary alloy, and when studied in 
more detail, in revealing the complexity of the process which experiments 
on chemical diffusion seek to study. 

The first of these points is dealt with in Section III. In this section 
we shall review the experimental evidence on which the second two 
points are made and include a reference to the more refined phenomeno- 
logical description of binary diffusion which the occurrence of a 
Kirkendall effect demands. 

As was described in more detail in PMP.l SMIOELSKAS and KIRKEN- 
DALL laid a parallel set of fine molybdenum wires along each of a pair of 
opposite faces of a rectangular bar of 30 per cent a-brass and then 
plated it overall with a thick layer of copper. After annealing to produce 
diffusion the opposite sets of molybdenum wire markers were found to 
have moved in towards one another by an amount proportional to the 
square root of the annealing time and greater than could be accounted 
for by density changes due to changes in composition. This shift of 
markers placed at the original interface between two interdiffusing 
metals or alloys, relative to parts of the sample where no diffusion 
occurs, is what is known as the "Kirkendall Effect." It has now been 
confirmed for various compositions and temperatures in copper a-brass 
couples by CORREA DA SILVA and MEHL, 3 BUCKLE and BLIN,* and 
BARNES 5 and shown by these investigators and by SEITH and KOTT- 
MANN 6 128 to exist also in many other single-phase interdiffusing 
systems such as Cu/Ni, 3 > 5 6 > 128 Cu/Au, 3 Ag/Au, 3 6 > 128 Ag/Pd, 6 ' 128 
Ni/Co, 6 . 128 Ni/Au, 6 ' 128 and Fe/Ni 128 and in the oc-ranges of Sn/Cu, 3 and 
Al/Cu. 3 > 4 

A variation in technique employed in many of these experiments was 
the use of "sandwich samples" prepared by welding to the opposite 
faces of a plate or disc of one of the metals or alloys, on which the 
markers were placed, plates or discs of the other metal or alloy. In all 
cases where measurements were made to show it, 2 " 5 the extent of the 
marker shift was again found proportional to the square root of the 
annealing time, strongly suggesting it to be a true diffusion pheno- 
menon ; it also increases with temperature at fixed annealing time, and 
is larger the greater the initial difference in composition between the 
two halves of the diffusion sample. In all cases the movement is 
towards that side of the sample with the lower melting point 3 towards 
copper in copper-nickel samples but towards gold in copper-gold samples. 
The observed shifts are of order 1/10 mm for the times and temperatures 
normally employed in diffusion experiments. 

CORREA DA SILVA and MEHL 3 measured the Kirkendall effect in a 
large number of sandwich samples both by the direct SMIGELSKAS and 
KIRKENDALL method and by new methods. These included, for 
example, measuring relative to the centre of a foil marked, the shift 
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along the diffusion direction of fine wire markers inserted at the same 
interface ("foil reference method'*) or of the edge of the foil itself, 
which was found to keep pace with the wire markers ("foil bend 
method"). See Fig. 1. No diffusion occurs across the centre of the foil 
which, therefore, always defines the original position of the interface. 
Such methods avoid measuring small changes in distance between well 
separated sets of markers as in sandwich samples, which might be 
affected by secondary processes unconnected with diffusion, but at the 
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Fig. 1. Kirkeridall Effect Experiments 

same time might be expected to interfere much more than wire markers 
with the unidirectional diffusion flow. They demonstrated that what- 
ever method of measurement was used and whatever type of marker 
was employed, be it alumina powder, carbonized silk thread or any one of 
a number of different metal wires or foils, the extent of the marker shift 
for any one system was always roughly the same for a given temperature 
and time of anneal ; this was true, most strikingly, even for platinum 
wires which dissolved to produce as "markers" regions within the lattice 
itself of high platinum concentration. This confirms that the markers 
play only a passive role in the process, serving merely to reveal a 
movement relative to one another of different parts of the lattice : to 
be explicit we may assume they are fixed relative to the lattice. Using 
the copper a-brass case as an example, an immediate inference from the 
Kirkendall effect then is that relative to the lattice the number of 
zinc atoms flowing from the brass towards the copper must be greater 
than the number of copper atoms flowing in the opposite direction to 
take their place. The copper side of the interface must expand, new 
lattice planes being created, to accommodate the increased number of 
atoms an 4 if a non-defective lattice in which all sites are occupied is to 
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be maintained the brass, which suffers a nett loss of atoms, must at the 
same time shrink. Thus is produced a motion towards regions richer in 
the more rapidly diffusing zinc of all lattice planes in the diffusion zone, 
and in particular of wire markers at the original interface, as observed. 

Several processes have been suggested which may lead to the 
unequal opposite flows of the two species of atoms which the Kirkendall 
effect implies. It now seems unlikely that a preferential transport of one 
component through the vapour phase is in general important, as was 
proposed 7 for the zinc in a-brass-copper couples, if only because a 
Kirkendall effect is found in such systems as the copper-aluminium 
one 3 * 4 where the vapour pressures are roughly equal and about 10 6 
times less than that of zinc. In addition, successive investigations by 
different workers 2 ' 4 3 5 on 30 per cent a-brass-copper couples have, 
through changes in technique, effected a removal or variation of special 
facilities for possible vapour transport such as porosity in electroformed 
deposits and imperfections at the initial interface, without leading to 
any appreciable change in the magnitude of the Kirkendall effect. This 
insensitivity of results to a most probable variation also of grain struc- 
ture provides evidence against a second possibility that the higher 
rate of flow of one component arises principally from its faster diffusion 
at grain boundaries. However, slight bulges in the moving interface in 
the direction of an increased Kirkendall effect have been observed in 
nickel-copper samples 5 near those nickel grain boundaries where suit- 
able etching has revealed a preferential diffusion of copper, but these 
only represent a very small addition to the shift of interface apparent in 
regions well away from grain boundaries. Furthermore, Kirkendall 
effects have been observed 6 ' 128 in samples the grain size of which was 
purposely made large (3-15 mm). It appears then that we must assume 
the different rates of transfer to be associated principally with differ- 
ences in the diffusion rates of the two atoms in the lattice itself. 

It is only at this stage that we are able to reject the possibility of a 
ring diffusion mechanism for those alloy systems in which a Kirkendall 
effect has been found (see Section III). 

The Kirkendall effect experiments, however, do not constitute the 
first or only evidence available to show that relative to the crystal 
lattice the two species of a binary alloy diffuse at unequal rates. As 
early as 1933 SEITH and KEiL 24 showed that the diffusion rates, at low 
concentration, of several elements in lead were substantially greater 
than the self diffusion rates of lead in the same alloys. JOHNSON, 13 also, 
in 1942 demonstrated by direct measurement the inequality of the 
self diffusion rates of silver and of gold in a 50 per cent alloy of silver 
and gold. 

BUCKLE and BLIN* have suggested that the two species diffuse by 
two different mechanisms, the zinc, say, by an interstitial mechanism 
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and the copper by a vacancy mechanism, so that unequal diffusion 
rates are to be expected. In the absence of any other evidence for 
interstitial solid solution in either oc-brass or in any other system showing 
a Kirkendall effect, and in view further of HUNTINGTON and SEITZ'S 
calculations (see Section III), this is not a very acceptable suggestion. 
SEiTZ 59 first clearly pointed out that diffusion of both species by the 
vacancy mechanism is perfectly adequate to describe the greater 
transfer of, say, zinc than copper, for it is only necessary to assume that 
vacancies exchange with zinc atoms more readily than with copper. 

The Phenomenological Description of Diffusion with a Kirkendall Effect 
and Further Phenomena Associated with Chemical Diffusion 

Whatever the mechanism of diffusion the demonstration of unequal 
rates of flow for the two atomic species led DARKEN S to propose that two 
separate diffusion coefficients D a and D b should be defined, one for each 
species, specifying their respective rates of flow relative to the markers, 
or to planes of the lattice. With certain simplifying assumptions he 
then showed, as described fully in PMP.l, that on account of the 
difference in rates of flow, planes of the lattice, or markers, at points in 
the diffusion zone where the concentration gradient is 3NJ3x will 
move relative to points where there is no concentration gradient and 
so well removed from the diffusion zone with a velocity v given by 

v = (D a - DJdNJdx .... (1) 

Also, the single chemical interdiffusion coefficient as normally 
measured experimentally, and referred strictly to the MATANO 9 inter- 
face, is given in terms of the separate or "partial chemical diffusion 
coefficients" D a and D b by 

T> = N a D b + N b D a .... (2) 

where N a and N b are the fractional concentrations of the two species 
at the composition to which D, D a and D b refer. Since D a ^ D b , D 
necessarily varies with composition, although such variation is due also 
to D a and D b since D does not appear to vary linearly with composition. 
D measures the rates of flow relative to a surface defined such that 
equal numbers of atoms of each species diffuse in opposite directions 
across it. This has the merit of simplicity in that such rates of flow are 
equal for the two species, and in many practical cases D is an adequate 
measure of the diffusion behaviour of an alloy. But a fuller description 
is obtained in terms of D a and D b and a better understanding of diffusion 
in binary alloys would probably be obtained through study of these 
theoretically simpler and more fundamental coefficients than through 
study of B, as has so often been attempted in the past. 
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In sufficiently dilute solution of, say, A in J5, D a =2= I). But in 
general D a and D b can only be calculated, using equations (1) and (2), 
from simultaneous measurement of D and v. It is very often found that 
the marker shift x is proportional to t 112 , t being the time of anneal. In 
such cases v can be obtained from a single measurement of x through 
the equation v = x/2t. However, LE CLAIRE 30 has pointed out that there 
is no obvious reason why in general x should be proportional to t 112 , 
although such a relation holds when there is no concentration change 
at the markers, a condition approximately fulfilled for markers placed, 
as they usually have been, at the original interface. SEITH and 
KoTTMANN 128 have recently studied the movement of markers placed 
in positions in the diffusion zone other than at the original interface 
between the two dissimilar metals, and it can be seen from their 
results that, for some markers, approximately doubling the time of 
anneal more than doubles even the corresponding values of x. The 
relation x ~ t 112 is then not in fact generally true and v is best deter- 
mined from a series of measurements of x and t. 

But such determinations of D a and D b are based on the validity of the 
assumptions made in deriving these above equations. DARKEN 
assumed that changes in lattice parameter on diffusion could be 
ignored, that a non-defective lattice was fully maintained by complete 
shrinkage and that both expansion and shrinkage took place only along 
the direction of diffusion so maintaining constant the cross section 
through which diffusion occurs. The first assumption, never strictly 
valid of course except in self diffusion measurements, may be avoided 
simply by measuring distances in diffusion samples not in terms of 
centimetres but in numbers of lattice spacings. This practice has been 
adopted in recent years by many workers. 3 ' 10 The validity of the other 
assumptions is a matter for experiment. 

A slight porosity in the brass side of their copper a-brass samples 
was reported by SMIGELSKAS and KiRKENDALL 2 and has been confirmed 
by later workers. 3 " 5 A similar porosity, consisting of large numbers of 
tiny holes on that side of the interface which the direction of the marker 
shift indicates is suffering a nett loss of atoms, has been found also in 
many other systems, such as Fe/Ni, 128 Ag/Au, 128 Cu/Ni, 5 ' 6 128 Cu/Al, 3 > 4 
Ag/Pd, 6 ' 128 and Ni/Au. 6 > 128 The amount of porosity varies consider- 
ably from system to system and sometimes may only be revealed by 
very careful mechanical polishing. It generally increases with increasing 
amount of diffusion and is greater the larger the initial difference in 
concentration. The density of holes is often not uniform throughout 
the side of the sample losing atoms but shows a well defined maximum in 
one region, the position of which has been observed in at least one case 128 
to move as diffusion proceeds so as to occur always at a particular con- 
centration in the diffusion zone, its distance from the interf&ce varying 



DIFFUSION IN METALS 

parabolically with the time of anneal. The pores may in some cases 
occupy, at the position of maximum density, a considerable proportion 
of the cross section of the diffusion zone in that region, and may join up 
in places to form larger cavities and fissures. SEITH and KoTTMANN 128 
for example found this proportion to be as much as 30-40 per cent for 
a pure gold/pure silver sample annealed for 98 hours at 900C. Occasion- 
ally the holes appear 4 * 5 to be octahedral in shape with faces parallel to 
{111} planes. For this reason, for the fact that they are produced as 
readily in samples made from exclusively vacuum cast material 4 and 
because they are uninfluenced by variations in the atmosphere in which 
samples are annealed, 5 the porosity does not appear to be caused in any 
way by dissolved gases or by secondary gas reactions. From its location 
in the diffusion zone then the existence of porosity strongly suggests 
that shinkage is incomplete. A sensitive test for complete shrinkage is 
to compare the actual marker shift it produces with that to be expected 
from the total nett transfer of atoms across the marker interface, as 
determined, for example, by subsequent chemical analysis. The two are 
only equal when complete shrinkage occurs ; otherwise the observed will 
be less than the calculated shift. SMIGELSKAS and KiKKENDALL 11 found 
that this latter was the case in their experiments and attributed the differ- 
ence to their observed porosity. Now in a sandwich sample of brass 
between copper the inward marker shift is due to the shrinkage of 
brass, but in a sandwich of copper between brass the outward marker 
movement is due to the expansion of the copper. BUCKLE and BLiN 4 
compared the shifts in these two types of samples and showed that the 
expansion of the copper, which was quite regular from sample to 
sample, was greater than the shrinkage in the brass, which showed more 
variation between successive samples, by an amount which corresponded 
quantitatively with the measured total volume of the holes that they 
found in the brass. Similar results were obtained by SEITH and 
KOTTMANN who, by placing additional markers outside the diffusion 
zones at the ends of sandwich samples, were able to compare for each 
sample the two measurements of the Kirkendall shift made relative 
to points, outside the diffusion zone, on each side of an interface. The 
end markers in such samples are also generally found 5 ' 6 128 to have 
moved apart after diffusion revealing an overall increase in length of 
the samples. This is of course to be expected if a porosity or incomplete 
shrinkage occurs, as also is a decrease in the overall density. Such 
changes of length and density are most apparent when magnified by 
using as diffusion samples a stack of strips or foils of the two metals 
packed alternately. In this way BABNES S showed the change in length 
to be proportional to the number of diffusion interfaces and to the 
square root of the annealing time and the change in density to cor- 
respond roughly to the total volume of holes. 
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The available evidence, both quantitative and qualitative, is then 
that this porosity is intimately connected with the diffusion process, 
and in view of its location in the diffusion zone, is indicative of an 
incomplete shrinkage of that part of the diffusion zone suffering a nett 
loss of atoms. As a typical example, BUCKLE and BLiN 4 found the 
marker shift referred to points outside the diffusion zone on the brass 
side of copper/a-brass samples, to be between 1/2 and 2/3 of the more 
consistent values measured relative to points on the copper side. How- 
ever, CORREA DA SILVA and MEHL 3 considered that porosity was 
negligible and shrinkage effectively complete, for in the many samples 
they examined they found the Matano interface, as computed graphic- 
ally from plots of concentration as a 
function of distance expressed in lattice 
spacings, to coincide with the original 
position, relative to the ends of the sample, 
Welded / ^Porosity of the initial interface between the two 
Interface halves of the sample. While this is a per- 

Fig. 2. Lateral Dimension fectl vaM test for comp l e te shrinkage, it 

Cha ^^ is > as BAL and ALEXANDER- point 

out, a very insensitive one, for the graphi- 
cally computed Matano interface varies only slightly in position for 
considerable changes in porosity. 

CORREA DA SILVA and MEHL 3 measured the lateral shift of foil 
markers placed parallel to the direction of diffusion (Fig. 1) in a copper/ 
a-brass specimen and concluded that since it was negligible all dimen- 
sional changes were in fact unidirectional as assumed by DARKEN. 8 
But even the shift to be expected from lattice parameter changes at the 
interface, which would be in the same direction as that due to a lateral 
shrinkage, was not observed, and it may well be that an expansion 
produced by porosity compensated for this expected contraction. How- 
ever, examination of the free surface of a specimen, where there is less 
constraint upon the tendency to expand or shrink, imposed by surround- 
ing material, has provided evidence that such dimensional changes may 
take place normal to the direction of diffusion. Fig. 2 shows the situa- 
tion observed in nickel/copper couples by SEITH and KOTTMANN, G and 
similar effects have been observed 128 by these authors in all the systems 
they studied. The Kirkendall shift is towards the copper in this system 
and an apparent expansion and shrinkage leading to an increase and 
decrease in the diameter of the sample on the sides of the interface 
suffering respectively a nett gain and a nett loss of atoms, is clearly 
evident. Similar effects have been noticed before by JOHNSON 13 and by 
FRAENKEL and HOUBEN U in gold/silver diffusion samples. JOHNSON 
attributed the whole of the effect he found to evaporation but SEITH and 
KOTTMANN prevented free evaporation by packing their camples in 
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alumina. In addition they observed in silver/palladium samples an 
overall decrease in length which they attribute to the nett lateral swell- 
ing of samples and which presumably is greater than the increase in 
length the porosity produced. This could not arise from vapour trans- 
port. That there is a tendency operating across the whole of the sample 
to shrink on one side of the interface and expand on the other is evident 
from the marked bending, observed by BAENES, S in nickel/copper 
samples that were very thin in the direction of diffusion. The curvature, 
uniform across the sample, was such that copper was on the concave 
side. Thermal expansion effects would have produced the opposite 
curvature in this experiment. SBITH and KoTTMANN 128 estimate that 
for one of their silver-gold samples (98 hr at 900C) only two-thirds of 
the expansion on the gold side of the interface occurred along the direc- 
tion of diffusion, the rest occurring laterally to produce an increase in 
diameter in that part of the diffusion zone. 

These results show that the assumptions underlying equations (1) and 
(2) may not always be even approximately satisfied experimentally. The 
occurrence both of porosity and of lateral dimension changes means 
that the cross section through which diffusion is occurring is not 
constant in time or uniform throughout the diffusion zone and may be 
very appreciably reduced in the region of maximum porosity. This 
may seriously impair the accuracy of measurement of jD. SEITH and 
KoTTMANN 128 determined from their Ag/Au and Fe/Ni samples, using 
the Matano method, the variation of D with composition for these 
systems and in each case the results showed what is most probably a 
false minimum for I) near those compositions at which maximum 
porosity occurs. 

The same effects will also in general influence the value of v. The 
best value of v for use in equation (1) is obtained by measuring the 
marker shift relative to points outside the diffusion zone, on that side of 
the interface richer in the slower diffusing component: i.e. by using 
samples in which the shift measured is produced by the expansion of 
that part of the diffusion zone receiving a nett addition of atoms. The 
direct effect of porosity in producing a lower value of the shift when 
this is measured relative to points on the other side of the diffusion 
zone is thereby avoided. But there remains the indirect effect of 
porosity, through its influence on the general course of diffusion, and 
the direct effect of the expansion being to some extent lateral to the 
direction of diffusion. 

However, it has usually been experiments in which two dissimilar 
pure metals are diffusing into one another that have revealed marked 
porosity and lateral dimension changes. When the initial difference in 
concentration is smaller, as for diffusion between one metal and an 
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alloy of it with another, these phenomena are less marked and their 
effect on measurements of D and v may be well within the normal 
experimental errors. But no attention has been given in the past to their 
possible influence and many experimental results may be suspect for 
this reason, especially those from samples covering a large concentra- 
tion range. 

The Vacancy Mechanism of Diffusion and the Kirkendall Effect 

We now return to the vacancy mechanism description of diffusion in a 
binary system, using the copper/a-brass case for convenience in discus- 
sion. If the vacancies exchange with zinc atoms more readily than with 
copper atoms then the flow of zinc atoms towards the copper, arising 
from a concentration gradient of zinc atoms, will be greater than the 
flow of copper atoms towards the brass, as the Kirkendall effect indi- 
cates is actually the case. But since each atom which moves in one 
particular direction causes a vacancy to move in the opposite direction, 
there will be a nett flow of vacancies from the copper towards the brass 
equal to the nett flow of atoms, which is towards the copper. (A 
mathematical expression for the nett vacancy flow in terms of different 
probabilities of exchange of a vacancy with the two species of atoms 
has been given by SEITZ 15 > 59 and is dealt with in Section V.) The 
interdiffusion of brass and copper is then accompanied by a pumping 
of vacancies from copper to brass, and we might expect the vacancy 
concentration to fall below that corresponding to equilibrium in the 
copper and to exceed it in the brass. But the observed nett transfer of 
atoms towards the copper is much greater than could be effected by 
that number of vacancies which are normally in thermodynamic 
equilibrium with the volume of copper usually employed in the experi- 
ments, so it is clear that some source of vacancies must be operating in 
the copper to produce the large number of extra vacancies required. 
Also, the nett number of atoms lost from the brass to the copper is 
replaced by an equal number of vacancies in the brass and since the 
volume of a vacancy is about the same as that of an atom the vacancies 
must disappear in some way if the brass is to shrink and the markers are 
to move towards the brass side of a diffusion sample. 

Possible sources for vacancies are at the surface of the specimen, or 
at the grain boundaries or dislocations it contains. If the surface acted 
as the predominant source, as once suggested by SEiTZ, 59 then we 
should expect the Kirkendall effect to depend on the volume of the 
diffusion sample and on whether the outer surface was of copper or of 
brass, but such dependence has never been found experimentally. 3 " 5 
In particular the marker shift is always towards the brass and by about 
the same amount in samples with copper sandwiched between brass and 
with brass sandwiched between copper, any difference between the 
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two being satisfactorily accounted for as due to porosity. In the 
absence of experiments on the Kirkendall effect in single crystals which 
might decide whether or not grain boundaries alone were the predomi- 
nant source, the most attractive and simple sources of vacancies are the 
dislocations. 16 Fig. 3 shows a simple edge type dislocation. If an atom 
at S moves down to T a vacancy is left at % 

8 and may then diffuse away by exchange 
with other atoms. If this is repeated by 
all S atoms in planes parallel to the 
figure, the whole dislocation will have 
moved along Z8 by one lattice spacing 
without change of form and one vacancy 
will have been produced for each lattice 
plane. If a dislocation moves in this way 
right across from one side of a crystal to 
another, a new plane of atoms will have 
been inserted into the lattice and the two 
halves of the crystal will have moved 
away from one another by one lattice 
spacing to accommodate it. Accommodation is therefore provided 
for the nett number of atoms diffusing into the copper by the very 
process which produces the vacancies necessary for their transfer. 

There is clearly a limit to the extent to which any one simple edge 
dislocation may act as a source of vacancies, for the completion of the 
extra plane of atoms means the removal of that dislocation from the 
crystal. It is doubtful whether the density of dislocations, if all of the 
simple edge type, would be sufficient to sustain 
this loss without it influencing the course of the 
diffusion. But a dislocation line is not in general 
of the simple straight edge type. It may form a 
closed ring of one type or another or may meander 
across the crystal from one side to the other, 
assuming an edge character in some regions and 
a screw character elsewhere, much as in Fig. 4. 
Each section of edge character, such as mn, may 
act as a source of vacancies in much the same 



Fig. 3. Simple Edge 
Dislocation 



J" 



Edge 
Screw 



Edge and Screw Com- 
ponents 



Fig. 4. General Dis- wav as d escr ibed above but extra atoms are not 

location line with j i i j. i - i i ux x 

added to complete a single plane but so as to 
build up as a spiral surface continuous with the 
spiral structure of, and therefore extending, the 
screw dislocations joined at the ends of the edge. For each complete 
turn of the spiral the edge is merely moved along the slip direction 
without change of form. Atoms may then be added indefinitely, that 
half of the crystal to the left of mn moving relatively to the other half 
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in order to accommodate them, and any number of vacancies produced. 
The process is completely analogous to the mechanism of crystal growth 
proposed by FRANK 17 (see Chapter 6 of this volume), the edge disloca- 
tion to which atoms are added corresponding to the ledge to which 
they are added on the growing face of a crystal. Other types of 
dislocations which would act as extensive sources of vacancies have 
been described by SEITZ. IS 

When we come to consider mechanisms for removal of vacancies we 
notice at once that the above-mentioned processes have only to be 
reversed for dislocations to act as sinks for vacancies. Thus 16 a vacancy 
arriving on any of the sites surrounding the atom S may exchange 
with it and so raise the dislocation line one spacing along SZ. Vacancies 
condensing on the dislocation line in this way ultimately remove a 
whole plane of atoms and cause the two halves of the crystal to move 
towards one another by one atomic spacing. Similarly a general 
dislocation line with edge and screw components may act as an effec- 
tively infinite sink for vacancies merely by reversal of that process by 
which it acts as an infinite source. 

An alternative proposal 3 ' 16 is that vacancies may aggregate on 
particular planes to produce platiform voids which, when they reach a 
critical size, collapse by inducing plastic flow in the surrounding 
material and with a resultant overall shrinkage. Such a collapsed 
platiform void has all the properties and characteristics of a general ring 
dislocation with a component of its Burgers Vector normal to its plane 
("Prismatic Dislocation") which, as SEITZ points out, 60 are sufficient 
to explain all those properties of a crystal for which dislocations are 
usually invoked. In fact he suggests that the dislocation content of a 
crystal may arise solely in this way by condensation, during solidifica- 
tion and cooling, of the very large number of vacancies present in the 
molten metal. In particular such rings may act as further sinks (or 
sources) for vacancies in addition to those provided by other types and 
forms of dislocation. 

The presence of a porosity in diffusion samples suggests that vacancies 
may also aggregate in a more isotropic manner to produce roughly 
spherical cavities. These we might expect to be stable when larger than 
some critical size and to grow by further accretion of vacancies into the 
visibly observable voids as first suggested by LE CLAIRE and BARNES. 19 
Produced in this way, it is not surprising that they should have 
crystallographic shape, for vacancies will exchange preferentially with 
atoms whose removal from the surface of the cavity produces faces of 
minimum surface energy. The incomplete shrinkage of that side of a 
diffusion couple suffering a nett loss of atoms, may then be attributed 
to formation of relatively stable aggregates of some of those vacancies 
which have effected this nett loss. Complete shrinkage only occurs when 

276 



DIFFUSION IN METALS 

the vacancies in excess of those corresponding to thermal equilibrium 
are removed completely by condensation on dislocations. 
The nett flux of vacancies at any point is given by 

D a . dcjdx + D b . 5c b /dx = (D a - D b )5cJ3x 

and SEITH and KoTTMANN 128 suggest that the maximum porosity 
occurs in a region of the diffusion zone where the concentration is such 
that this expression is a maximum. 

If the expansion in the copper and the contraction in the brass are to 
be only along the direction of diffusion, the planes of atoms removed or 
added during the operation of a dislocation as sink or source of vacancies 
must be normal to this direction. The dislocation must then have its 
Burgers Vector parallel to the diffusion direction. But it is dislocations 
of just this orientation which will be most employed as sources and 
sinks, for if of other orientation those parts of their dislocation edges 
which are advancing along the diffusion direction would soon run out of 
the diffusion zone, and the capacity of that dislocation as a source or 
sink for vacancies in consequence be reduced. BARDEEN and HEBiiiNG 20 
propose this as an explanation of CORREA DA SILVA and MEHL'S result 
(see p. 272) that there is no appreciable lateral change of dimensions on 
diffusion. While we have seen that this result is not in general true, it is 
true that the expansion and shrinkage take place to a much greater 
extent along the direction of diffusion than would correspond to 
isotropic expansion and shrinkage. A further factor contributing to 
this may be that the surrounding undiffused material will impose 
constraints upon a lateral shrinkage or expansion of the diffusion zone 
i.e. upon the removal or growth of planes of atoms in a direction 
parallel to that of diffusion, but not in directions orthogonal to this. 
When the diffusion zone extends over the whole volume of a sample, so 
that there is no undiffused material, the dimensional changes accom- 
panying diffusion are in fact more nearly isotropic, as recent experiments 
on diffusion into very thin foils and wires have shown. 129 

The growth or removal of planes of atoms in both single crystal and 
poly crystalline material will introduce stresses into the lattice, in 
particular between neighbouring grains in poly crystalline material. 
These may only be relieved by inducing recrystallization and grain 
growth, by introduction of new dislocations and, therefore, of further 
sinks and sources for vacancies, and by processes of plastic deformation 
which themselves contribute to the observed marker shift and lateral 
changes in dimensions. Additional stresses with similar consequence 
will arise from planes of atoms being forced to conform to changes in 
lattice parameter on diffusion. Recrystallization and grain growth in 
the diffusion zone between two metals has often been reported in the 
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past 21 and a more recent observation 5 of "polygonization" in the same 
region provides experimental evidence for the plastic flow. 

Rough calculations 15 ' 20 have claimed to show that the normal 
dislocation content of a metal is adequate to maintain an equilibrium 
concentration of vacancies throughout the diffusion zone, despite the 
presence of a nett flux of vacancies. For such to be the case the average 
distance travelled by each vacancy during its lifetime must be very 
much less than that over which diffusion occurs, and it is considered 
that the normally assumed density of dislocations should be adequate 
to ensure this, even when by no means every encounter of a vacancy 
with a dislocation leads to its annihilation. However, the occurrence 
of a marked porosity in many diffusing systems would appear to invali- 
date these conclusions and there seems no reason to suppose that a 
vacancy concentration which is everywhere in thermodynamic equili- 
brium with the lattice is in fact maintained in all cases of diffusion. 

The creation and annihilation of vacancies will take place more 
readily at irregularities along an edge dislocation, such as "jogs", 73 so 
that the effective number of points where such processes can occur may 
in fact be an even smaller fraction of the number of atoms along 
dislocation edges than these calculations have allowed for. 

Conclusions, r dating to the Theoretical Value of Chemical Inter diffusion 

Measurements 

The demonstration of the various phenomena associated with the 
Kirkendall effect makes it difficult to know what theoretical significance 
may be attached to the results of chemical diffusion measurements. 
We have already discussed the difficulties associated with the measure- 
ment of D a and D b through use of equations (1) and (2), arising from 
porosity and changes in lateral dimensions, but even when these are 
sufficiently small not to affect the calculation of D a and D b from 
measurements of D and v, there would still appear to be several 
factors controlling the values obtained which are not normally taken 
account of, and which may be very difficult to take account of, in any 
theoretical understanding of their magnitudes. 

Theoretical treatments usually assume that the rates of chemical 
diffusion in an alloy depend only upon the vacancy concentration and 
upon the different probabilities of exchange of the two atoms with a 
vacancy in the presence of a concentration gradient, these probabilities 
being determined only by the local composition. It is assumed in 
theoretical treatments that the vacancies are in equilibrium with the 
lattice but in the case of chemical diffusion there exists the possibility 
of a vacancy concentration which is not only removed from its equili- 
brium value but also continuously changing during the diffusion. Also, 
it has been suggested 22 that vacancies may aggregate to form double 

278 



DIFFUSION IN METALS 

vacancies in regions of high vacancy concentration and that these 
would be more mobile than single vacancies and so lead to a high 
diffusion rate for the atoms. Abnormally large diffusion rates may then 
occur in that side of a diffusion couple which suffers a nett loss of atoms. 
We may mention that two independent diffusion coefficients D a and D b 
may only be legitimately defined by Fick's Law, and so equations (1) 
and (2) are only strictly valid, when the vacancies are everywhere in 
equilibrium with the lattice. Otherwise the rate of flow of, say, A 
atoms depends not only on the concentration gradient of A atoms, but 
also on that of the vacancies. This is discussed more fully in Section V. 
Furthermore, there is the possibility of stresses influencing the rate of 
diffusion and of inhomogeneous plastic flow concurrent with the diffu- 
sion, which later may generate additional vacancies and produce 
distorted lattice regions in which abnormal diffusion rates occur. The 
last two factors are associated with the shrinkage and expansion which 
accompany chemical diffusion and, with the addition of stresses arising 
from the interdiffusion of atoms of unequal size, will tend to produce 
recrystallization and so an enhancement of the grain boundary contribu- 
tion to the overall flow. 

The diffusion rates then appear dependent upon a number of quanti- 
ties many of them changing during diffusion and varying with the 
nature of the sample and all difficult to determine separately. The 
relative importance of these several factors in determining the measured 
values of chemical diffusion coefficients and their variation with 
composition and the details of their operation in chemical diffusion, 
remain matters for further experimental study, but their presence may 
well be found to be not unconnected with the often very considerable 
disagreement between the diffusion coefficient measurements made on 
the same system by different observers. Since the Kirkendall effect, 
and therefore phenomena associated with it, is larger the greater the 
difference in concentration between the two halves of a diffusion sample, 
we might expect the most reliable and repeatable chemical diffusion 
coefficient measurements to be those conducted with the smallest 
practicable concentration gradients. A study of the dependence of I) 
upon concentration gradient at fixed concentration might reveal to 
what extent these complicating features, associated with a Kirkendall 
effect, do, in fact, influence the measured value of D. But the rates of 
self diffusion of the two species of atoms in a chemically homogeneous 
alloy depend only upon the vacancy concentration and upon the pro- 
babilities of exchange of the atoms with vacancies. Since chemical 
homogeneity is maintained throughout the diffusion process the vacancy 
concentration remains always at its equilibrium value and none of the 
other complicating factors which may influence chemical diffusion 
rates and their measurement arise at all. Self-diffusion coefficients 
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of the atoms of an alloy have then a much simpler and direct significance 
and furthermore give directly the most fundamental quantity underly- 
ing all diffusion phenomena, namely the average number of random 
jumps made per second by each atom in a homogeneous alloy in 
thermodynamic equilibrium. Measurements of self-diffusion coefficients 
would probably be far more profitable in furthering a detailed under- 
standing of diffusion in alloys than the already very considerable 
number of chemical diffusion coefficient measurements have been. 

SECTION II 

GRAIN BOUNDARY AND SURFACE DIFFUSION 

It has always been appreciated that in the neighbourhood of a grain 
boundary and on the surface of a metal the rate at which diffusion takes 
place may differ appreciably from that within a single crystal or grain 
because of the different structure which these regions present. We may 
define coefficients D^ D gb , and D 8 specifying respectively the rates of 
flow by diffusion in the lattice, in grain boundaries, and on the surface 
of a metal and a full description of the diffusion behaviour of any system 
requires a knowledge of all these coefficients and their variation with 
concentration. Such a knowledge is clearly important for a proper 
understanding of the practically important diffusion rates in poly- 
crystalline materials for in general these will be determined by D gb and 
D s as well as D t . Many of the discrepancies between the values of 
diffusion coefficients obtained by different workers on the same system 
for both self and chemical diffusion may well be connected with the very 
common failure to take proper account of the possible contribution to 
the observed diffusion flow by grain boundary processes, and which may 
have been more significant in some cases than others. D 8 may vary with 
the orientation of the surface and D gb with both the mutual orientation 
of the grains adjoining the boundary and with the direction of the grain 
boundary for fixed orientation. Studies of grain boundary and surface 
diffusion may then also provide information on the nature of those 
regions in which they occur. 

Whereas lattice diffusion can by suitable means be studied apart 
from grain boundary and surface diffusion, these processes themselves 
must necessarily be accompanied by diffusion in the lattice of the crystals 
or grains with which they are so intimately associated, and so cannot 
be studied apart from lattice diffusion. Experimental methods are, 
therefore, directed towards a comparison of the rates of flow by grain 
boundary or surface diffusion with that by lattice diffusion. 
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Grain Boundary Diffusion 

Two principal methods of investigation have been employed for grain 
boundary diffusion studies. 

I. The comparison of the overall rates of diffusion in materials of 
various grain sizes but similar in all other respects. It has generally 
been assumed in the past 21 but as we shall see, erroneously, that only 
when the measured bulk diffusion coefficient varies with grain size is 
D gb appreciably different from D,. 

II. By examination of individual grain boundaries in sections cut 
from diffusion samples parallel to the direction of diffusion the relative 
amounts of diffusion at the grain boundaries and in the neighbouring 
grains may be directly observed. This method usually requires the use 
of a suitable etch which will reveal differences in composition. It is 
clearly an essential condition for full advantage to be taken of this more 
informative method that there should be no changes in the grain 
boundary pattern during the diffusion. If the grain boundaries do 
move as a result of recrystallization or grain growth any difference 
between grain boundary and lattice diffusion rates may be completely 
masked, and even if not completely so the results will anyway be of 
indefinite quantitative significance. 

In the comparatively few studies of grain boundary diffusion that 
have been made, it has been reported that D gb and D l do not always 
differ but that when they do, they, and the corresponding activation 
energies Q gb and Q^ where these have been estimated, are such that D l is 
less than D gb and Q l greater than Q gb . This is not surprising in view of 
the more open structure presented for diffusion at grain boundaries. 
This early work and its results have been fully reviewed on other 
occasions. 21 ' 25 26 50 

More recently a much needed mathematical analysis of simultaneous 
grain boundary and lattice diffusion has been presented 27 and has 
provided a method much less objectionable than hitherto, for the 
quantitative measurement of the ratio D^/D^ Quantitative estimates 
of D gb have been made in the past but these have usually been based on 
a far less satisfactory analysis, often employing, for example, the 
Dushman-Langmuir equation. 28 Application of the analysis has been 
made to a few systems and it has also shown that method I will not 
necessarily always reveal a difference between D l and D gb even if a very 
appreciable difference does in fact exist. This method must then be 
applied with caution and many reports for various systems studied by 
it, that D gb and D l are appreciably the same, may, therefore, be false. 
A second important new advance has been a study for the first time of 
the dependence of grain boundary diffusion rates upon the orientations 
with respSct to the boundary of the two adjoining grains. 
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Methods I and II have both been used to reveal a preferential grain 
boundary diffusion in a few more systems in which it had not previously 
been observed, although much of this has been in connexion with the 
work mentioned in the last paragraph. 

Simultaneous Grain Boundary and Lattice Diffusion 

FiSHER 27 has considered diffusion into a semi-infinite bi-crystal with its 
grain boundary normal to the surface, as shown in section in Fig. 5. 
Several assumptions are made to simplify the analysis. (1) The 
concentration C of diffusing material at the surface y is main- 
tained constant throughout the anneal. (2) The grain boundary is 
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m concentration 
" contour 



. 9 rQin 
boundary (x=0) 



Fig. 5. Diffusion in the Neighbourhood of a Grain Boundary 

regarded as a uniform slab of material of width 6 in which the diffusion 
coefficient D gb is much greater than that for diffusion in the adjoining 
crystals, D l : the grain boundary region is not, of course, so well 
defined a region as this. Material diffusing more rapidly along the grain 
boundary than elsewhere will be lost from it laterally into the adjacent 
grains so producing an effective penetration of material which is greater 
in the neighbourhood of the grain boundary than the uniform penetra- 
tion by lattice diffusion alone in regions well removed from the grain 
boundary. It is assumed (3) that this lateral loss occurs by diffusion 
only in a direction normal to the grain boundary, (4) that D l and D gb 
are independent of concentration, and (5) that the concentration in the 
grain boundary is uniform across its width d ; this is taken into account 
in drawing in Fig. 5, a line of constant concentration or a "concentration 
contour." 

With these assumptions a numerical integration of the appropriate 
differential equations, for diffusion in the grain boundary and for 
lateral diffusion in the adjoining grains, gave a solution the behaviour 
of which had certain features which suggested a simple analytical 
expression for C gb , the concentration of laterally diffused material in 
the vicinity of the grain boundary. 



= <7 exp (- 



(3) 
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where x l9 7/ ]5 and ^ are dimensionless quantities given by 

Xl = x/6, Vl = 



It is to be emphasised that while this represents a useful and practi- 
cable relation for studying grain boundary diffusion, and so far as it has 
yet been tested 29 leads to results which are confirmed by experiment, it 
is obtained only by making very considerable simplifications in the 
mathematics and a more detailed and rigorous study of the problem 
would be very welcome. Failure to bear in mind the many assumptions 
upon which equation (3) is based may lead to serious errors in the 
interpretation of experimental results. 

Let us now consider diffusion into a polycrystalline sample. For 
simplicity we assume this to have a columnar structure so that all the 
grain boundaries are parallel to the direction of diffusion and intersect 
the free surface normally as does the one in Fig. 5. In any thin section, 
of width A?/ cut normal to the direction of diffusion at a distance y from 
the free surface y = 0, a certain amount A l of the material which has 
diffused into the section will have been transferred only by lattice 
diffusion through the grains and the remainder A gb by diffusion down 
the grain boundaries and by subsequent lateral diffusion from them. 
If there are n grains with average linear dimensions R normal to the 
columnar or diffusion direction, than A l is given by 

A t = n& . A ?/ . CtErfcd/WDtf'*) .... (4) 

where C Q is the supposedly constant surface concentration of diffusing 
material and t is the annealing time. If we assume that the total 
amount of material actually in the grain boundaries is negligible com- 
pared with that which has been lost by lateral diffusion, we may write 

for A b 

+ M C g ,.dx .... (5) 

where C gb is given by equation (3). TuKNBULL 26 has used equations 
(4) and (5) to study the relative importance of transfer by lattice and 
by grain boundary diffusion in diffusion experiments on polycrystalline 
material, and has presented some results, applicable to a typical set of 
experimental conditions, in the form of Table 1. This shows for various 
values of D gb /D l the grain size for which A l and A gb are equal, when 

TABLE 1 



DID t . 


10 6 


10 6 


10 4 


10 3 


Grain size for which 










A gb = AI (cm) . 


0-26 


0-0014 


2-15 . 10- 6 


<10- 
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y 0-2 cm, t = 10 5 sec, and D l = 10~ 9 cm 2 /sec, conditions such that 
the concentration at y of material penetrating by lattice diffusion 
is ~ 15 per cent of c . If we may define lattice diffusion as predominant 
when A l > 10^ & , then since AJA^ ~ R, experiments in which a 
measured diffusion coefficient showed no change when the grain size 
was varied from 2-6 to 0-014 cm would not be evidence for assuming 
that D gb and D t were appreciably the same, but only that D ffb /Dt < 10 5 . 
Thus we may only expect method I to reveal a difference between D gb 
and D l when a very considerable range of grain sizes is employed. 
Alternatively, since Q gb is usually less than Q^ the ratio D gb /D t is 
increased with decreasing temperature, so that method I may produce 
positive results at sufficiently low temperatures where D gb is so much 
greater than D t that A gb is comparable with A l for the grain sizes used. 
When D gb is appreciably greater than D t material will diffuse much 
further into the sample in the neighbourhood of a grain boundary than 
elsewhere and if sections are cut at a greater distance from the surface 
y than the maximum depth of penetration by lattice diffusion 
alone through the grains, their content of diffusing material will be 
given by equation (5). Performing the integration and taking logs, we 
find 27 for A gb l&y the content per unit thickness, 



log (AJky) = - ^(DJD^/frDttn + constant (6) 

where the constant varies with t but does not contain y. 

This relation has been used by HOFFMAN and TtfRNBULL 29 to deter- 
mine for the first time a grain boundary self -diffusion coefficient, that of 
silver. A thin layer of radioactive silver was electroplated on to one 
face of a number of poly crystalline samples, which were then annealed 
at temperatures in the comparatively low range 500-7 00C. It was 
assumed that surface diffusion and diffusion in the highly disorganized 
thin electroplated layer would be sufficiently rapid to maintain a fairly 
constant surface concentration necessary for the validity of equation 
(3). The log of the specific activity of thin layers cut parallel to the 
surface after anneal was found to vary linearly with their distance y 
from the surface. Transfer by purely lattice diffusion would have 
required a proportionality to y 2 ; that this was not found may be 
regarded as evidence in favour of FISHER'S equations (3) and (6). 
Knowing Z) t (see equation (8)) and assuming a value of 5 . 10~ 8 cm for 6, 
they found from equation (6) that for high purity silver 

D gb = 0-03 exp (- 20,200/jRT) cm 2 /sec .... (7) 

At about 500C, therefore, D gb /D l ~ 10 5 . 

To define D gb properly requires the validity of assumption (2) above 
and to calculate it from equation (3) or (6) requires a knowledge of (5, 
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for the equations only give the product (D gb d). But it may be more 
useful 30 to characterize grain boundary diffusion behaviour in terms of 
this measured quantity rather than D gb alone for it is more significant 
in representing, without reference to d, the amount of material diffusing 
in unit time along the grain boundary and across unit length of it 
normal to the diffusion direction, the concentration gradient also 
being unity. 

HOFFMAN and TunNBULL 29 have also measured, by standard methods, 
the self-diffusion coefficients of silver in both single crystal and poly- 
crystalline material, with the results shown in Fig. 6. At high tempera- 
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Fig. 0. Lattice and Grain Boundary Diffusion in Silver 

tures single crystal and polycrystalline samples show the same self- 
diffusion coefficients and these results, which fit closely the earlier high 
temperature measurements of JOHNSON 31 on coarsely polycrystalline 
silver, may, therefore, be taken as representing the true lattice diffusion 
coefficient. The equation to the full line is 

D l = 0-895 exp ( 45,950/jRT) cm 2 /sec .... (8) 

so that Q gb ~ \Q lt But at lower temperatures the apparent diffusion 
coefficient for polycrystalline material is appreciably greater than that 
for single crystals and for experiments between 500 and 650C fit more 
closely the equation 

* D app .= 2-3 . 10- 5 exp (- 26,400/J?T) cm 2 /sec .... (9) 
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It is clear that at high temperatures and with the grain sizes employed 
(^ 10~ 2 cm) D gb is not sufficiently greater than D l for the contribution 
to the overall diffusion flow by grain boundary diffusion to be significant, 
but that at lower temperatures where D gb fD l is greater, grain boundary 
diffusion is contributing appreciably. 

These experiments are quoted in full because they illustrate a very 
important point in connexion with all experimental determinations of 
diffusion coefficients in polycrystalline materials which purport to give a 
measure of the lattice diffusion coefficient. It is usually considered that 
the more extensive the temperature range over which such measure- 
ments are made, the more accurate will be the resulting values of D 
and Q (in the equation D D exp ( Q/ET)) derived from them. 
We now see that they may represent lattice diffusion only at high 
temperatures but that abnormally high values of D may be obtained 
at the lower temperatures due to appreciable grain boundary diffusion. 
Experiments are not usually sufficiently accurate or numerous to reveal 
a departure at low temperatures from a linear relation bet\reen log D 
and l[T, so a "best straight line" drawn uncritically through both high 
and low temperature results, when plotted as log D against 1/T, may 
then produce values of D Q and Q which are of no particular theoretical 
significance and both less than the true values for lattice diffusion, 
which it is often assumed they represent. In fact the results for high 
temperatures in Fig. 6 with those at low temperatures for polycrystal- 
line material, together lie quite closely to a straight line with equation 

D = 8 . 10- 3 exp (- 38,000/^y) 

D is very appreciably less than, and Q lies between, the true values for 
lattice and for grain boundary diffusion. 

HOFFMAN and TURNBULL took a contact autoradiograph of a taper 
section of a diffusion sample, cut in the diffusion zone and at a small 
angle to the plated surface. Blackening was strongest along a series of 
lines which corresponded exactly with the traces of the grain boundaries 
revealed by etching the same sample, and the lines thicker on that side 
of the section nearer the plated surface. This indicates, independently 
of FISHER'S analysis, that D gb is, in fact, greater than D l for self-diffusion 
in silver. The same experiment has been performed with tin by 
FENSHAM, 32 with similar results. 

Method II (p. 281) has been used in the past only to reveal quantita- 
tively any difference between D gb and D v By suitable choice of etch, 
or by use of contact autoradiography in grain boundary self-diffusion 
studies, it may yield results from which concentration contours as in 
Fig. 5 could be obtained. Their shape at distances along the grain 
boundary greater than the maximum depth of penetration' by lattice 
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diffusion alone is governed by equation (3) and can, therefore, be used 
to test this equation and to derive values of (D gb d). 

LE CLAIRE 30 pointed out that simple differentiation of equation (3) 
shows the angle < at which a concentration contour meets the grain 
boundary to be simply related to D gb d, independently of concentration, 
by the equation 

.... (10) 



This is derived directly from equation (3) without further assumptions, 
so that observations on this one simple geometrical feature 6f concentra- 
tion contours can be used to check FISHER'S equation. If valid then 
when D gb and D t are assumed independent of concentration <j> is the 
same whatever the concentration at the contour, and for observations 
on the same grain boundary after successive anneals, tan 4 </> should be 
proportional to t. 

It also provides a quick and convenient method for determining D gb 
and its variation with temperature and orientation. BARNES 33 has used 
method II to show that copper diffuses more rapidly in the grain 
boundaries than in the grains of nickel and the etchant used was such 
as to reveal several concentration contours. 

Application of equation (10) showed that at 1000C for this system 

DJD, ~ 8 . 105 

The depth of penetration by grain boundary diffusion in this system 
is also greater than that by lattice diffusion by an amount which 
decreases markedly with increase in temperature, suggesting that Q gb 
is again less than Q t . 

Orientation Dependence of Grain Boundary Diffusion 

BECK, SPERRY and Hu 34 have suggested that their observations, that 
grain boundary migration during recrystallization is more rapid the 
larger the disorientation between the grains flanking the boundary, 
may be explained by assuming that the rate of diffusion of atoms within 
the grain boundary varies with orientation. But the first direct study 
of the variation of grain boundary diffusion rates with relative grain 
orientation has been made recently by ACHTER and SMOLUCHOWSKi, 35 
for silver diffusing in copper. BERMAN and HARRINGTON 36 had previ- 
ously shown qualitatively for this system that silver diffused into copper 
more rapidly in the grain boundaries than in the adjoining grains. 

For convenience in interpretation of the results, ACHTER and 
SMOLUCHOWSKI used specimens of "columnar" copper, in which all 
grains had approximately a common cubic direction and their grain 
boundaries parallel to this direction. Two co-ordinates completely 
specify tlfe configuration of each grain boundary in such a sample, 
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rather than the five required in the more general case. The relative 
orientation of the two grains flanking the boundary is given by the 
angle 6 between a pair of cube axes normal to the columnar direction, 
and the direction of the grain boundary by the angle ft between one of 
the cube axes and the plane of the boundary. Diffusion couples of Cu 
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6 

Fig. 7. Extent of Diffusion X vs. Orientation Difference 
141 hr at 725C 

Extent of Volume Diffusion 
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and Cu + 4 per cent Ag were welded together with the columnar 
direction in the copper normal to the interface, so that all grain boun- 
daries were parallel to the direction of diffusion of the silver. After 
annealing, samples were quenched to retain the silver in solid solution. 
The maximum depth of penetration of silver down the grain boundaries 
was determined by etching consecutive sections lapped parallel to the 
interface and noting at what depth from it for each grain boundary 
silver, as revealed by the etchant, could no longer be detected. The 
amount of silver in each grain boundary decreased with increasing 




Fig. 8. Columnar Diffusion Sample cut parallel to Interface 

distance from the interface and the maximum depth of penetration was 
found to vary with the relative orientation 0, as determined by etch pit 
methods, in the interesting manner shown in Fig. 7. The appearance 
of a typical etched section is shown in Fig. 8. At angles < 20 and 
> 70 the silver does not penetrate along the grain boundaries any 
further than it does into the grains by lattice diffusion alone, but for 
6 > about 20 the grain boundary penetration depth exceeds* the lattice 
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penetration by an amount which increases as 6 increases from 20, 
reaches a maximum at 45 and then falls to zero again at 70. The 
results also suggested, but only vaguely, that the preferential grain 
boundary penetration may be least where the grain boundary is 
symmetrical with respect to the lattices of the adjoining grains, so that 
/? = 6/2. 

It would appear then that grain boundary diffusion rates are not 
simply or directly related to the grain boundary energy, since READ 
and SnocKLEY 37 have shown the energy to increase continuously as 
increases from zero up to about 20. 

The results were based upon observations made on pairs of grains 
which deviated from the perfect columnar structure by less than 7-8 
but there were manjr pairs 41 with an angle y between those (100) 
directions most nearly parallel to the general columnar or diffusion 
direction, which was appreciably greater than 8. At a junction of three 
grains in a section perpendicular to the diffusion direction it often 
appeared that silver at a silver rich boundary had diffused perpendicu- 
larly to the columnar direction a little way into a silver free boundary as 
at (a) in Fig. 8. Such situations enabled a comparison to be made of 
diffusion rates in two mutually perpendicular directions in one grain 
boundary. It appeared 41 that even when was less than 20, so that no 
preferential grain boundary diffusion occurred along the columnar 
direction, an appreciable diffusion perpendicular to the columnar 
direction in the same boundary could occur provided y was greater 
than 20. 

These results reveal not only a dependence of D gb upon the relative 
orientation of the grains but also, for fixed orientation, upon the 
direction within the grain boundary, i.e. an anisotropy of grain boundary 
diffusion. 

In an earlier series of experiments using a slightly different method, 
ACHTER and SMOLUCHOWSKi 38 estimated the activation energy for 
grain boundary diffusion of Ag in Cu of about 23,800 cal/mole compared 
to the lattice diffusion activation energy estimated at the same time, 
of about 38,300. 

Quite recently all this work has been criticized by DAWSON, 39 who 
suggests that it may only be a study of grain boundary precipitation 
as a function of solute supersaturation and orientation difference. He 
supposes that the silver diffuses into the copper everywhere by normal 
lattice diffusion and that on cooling the grain boundaries merely act 
as preferred sites for precipitate nucleation. However, ACHTER and 
SMOLUCHOWSKi 40 point out that the manner in vhich the silver is 
distributed in and around the grain boundaries is inconsistent with this 
viewpoint as also is the observed anisotropy and low activation energy. 
But even more recently FLANAGAN and SMOLUCHOWSKi 42 have extended 
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this work to the diffusion of zinc in copper, a system free from precipita- 
tion effects but in which a variation of grain boundary penetration 
with orientation was observed very similar in nature to that of the silver- 
copper system. DAWSON'S criticisms seem then unfounded. A grain 
boundary activation energy of 24,500 cal/mole has been estimated 42 
for Zn diffusing in Cu compared to a lattice diffusion activation energy 
of 34,000 cal/mole. 

Grain Boundary Models and Diffusion 

Many of the above results have been discussed by ACHTER, FLANAGAN 
and SMOLUCHOWSKi 35 ' 42 > 13 in terms of some of the various models of a 
grain boundary that have been proposed by other workers (see PMP.l, 
Chapter 3). If we suppose the boundary to consist of a suitable array of 
edge dislocations then an inference from the anisotropy observations is 
that diffusion is most rapid in a grain boundary along the lines of the 
edge dislocations which compose it provided there is a difference in 
orientation between adjacent grains of more than 20, but no faster than 
ordinary lattice diffusion in a direction perpendicular to the dislocation 
lines. For when y is small and 6 > 20 the dislocation lines run most 
nearly along the columnar direction, in which the grain boundary diffu- 
sion is rapid, but when y > 20 and 6 small, as in some pairs of grains 
deviating considerably from the columnar structure, dislocation lines 
run most nearly perpendicular to the general columnar direction and it 
is only in this direction now that grain boundary diffusion is rapid. 

That a disorientation of as large as 20 is necessary before any prefer- 
ential grain boundary diffusion becomes apparent, strongly suggests 
that the mobility of atoms along individual edge dislocations is not 
appreciably different from their normal lattice mobility, for provided the 
angle is less than about 20 the dislocations composing the grain boun- 
dary are still individually distinguishable and separate from one 
another. But the angle of 20 at which the grain boundary rate begins 
to be appreciably greater than that of lattice diffusion, is just in the 
range where the distance between dislocations becomes so small that 
their individuality is lost ; the grain boundary is now better represented 
by such a model as that of MoTT, 43 who pictures it as consisting of 
islands of good lattice fit surrounded by regions of misfit, which we may 
regard perhaps as formed by the grouping together of now individually 
indistinguishable dislocations. This change in structure of the grain 
boundary as 0, increasing, passes through a value of about 20 has been 
demonstrated by LOMER and NYE 44 with the BRAGG bubble model, and 
appears to be made manifest by the grain boundary diffusion behaviour. 
While the isolated regions of misfit described as single dislocations pro- 
vide no special facilities for enhanced grain boundary diffusion, because 
perhaps the void space associated with them is too small to pl&y the role 
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of vacancy and permit passage of a diffusing atom, (we should not then 
expect such sub-structural features of single crystals as mosaic boun- 
daries to influence diffusion rates) the more extensive regions of misfit, 
which they combine to produce at larger angles, do allow more rapid 
diffusion because perhaps of their increased void space. Twin boundaries 
are regions of good lattice fit even if the relative orientation of the twin 
grains is large, and accordingly it is found that the diffusion rate in 
them is less than corresponds to the relative orientation, as shown by 
the cusp in Fig. 7 at angles corresponding to twinning on {210} planes. 
BARNES 33 found no evidence for any increased diffusion of copper at 
the presumably {111} twin boundaries of nickel. The islands of good 
fit will be elongated, and the arrears of misfit, therefore, less interrupted, 
along the same direction as that of the dislocation lines at lower angles 
of disorientation, so that grain boundary diffusion will be preferred 
along this direction and the anisotropy as observed at angles greater 
than 20 is still explicable. 

It will be clear that FISHER'S assumption (2), and therefore his equa- 
tion (3), is most nearly valid for large angle boundaries where the 
regions of misfit will be extensive flat areas covering a large proportion , 
if not all, of the grain boundary. Such boundaries are believed 26 to have 
been predominant in the polycrystalline silver used by HOFFMAN and 
TURNBULL in the experiments described above. At smaller angles the 
regions of misfit will be less extensive, and for only a little greater 
than ~ 20 may be regarded 132 as rods or channels of distorted struc- 
ture, separated by relatively undistorted areas. D gb and Q gb are now to 
be taken as referring to diffusion in the distorted misfit regions only of 
a grain boundary. Now the maximum depth to which material pene- 
trates by diffusion down a grain boundary is determined both by D gb 
and by the rate, which depends on D ly at which material is lost from the 
boundary into the adjoining grains. For low angle grain boundaries 
this loss of material, being in all radial directions from each of the rod 
shaped regions of misfit, will, relative to the flux of material down the 
misfit region themselves, be greater than for high angle grain boundaries 
where the loss is principally by planar diffusion from the extensive 
flat areas of misfit. The maximum depth of penetration, for this reason 
alone, will therefore increase with increasing values of 0, above a certain 
value, as shown in Fig. 7. 

Since Q gb < Q t , the rate of loss of material . from the boundary 
increases more rapidly with temperature than the rate of flow down the 
boundary itself. Thus the apparent activation energy, calculated from 
measurements of the variation with temperature of the maximum depth 
of penetration, will be less than the true activation energy Q gb and will 
vary with the structure of the boundary, i.e. with the angle 0, being 
lowest and possibly even negative for the lower angle boundaries 
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(rw 20), where the loss into the grains is relatively more important. 
For < ~ 20, of course, where there is no preferential grain boundary 
diffusion, the apparent activation energy equals Q v 

FLANAGAN and SMOLUCHOWSKi 42 have recently studied the tempera- 
ture dependence of the grain boundary diffusion of zinc in copper and 
have found an orientation dependence for the apparent activation 
energy of just the type expected from the above described model. 
Taking into account the changing structure of the grain boundary with 
0, SMOLUCHOWSKi 130 has presented a theoretical discussion of grain 
boundary diffusion, with particular reference to the apparent activation 
energy, and obtained values for this which vary with in a manner 
closely agreeing with that observed in the zinc in copper experiments. 

Little consideration has yet been given to the precise mechanism 
of diffusion in a grain boundary. Although the tendency is to discuss 
this in terms of the vacancy mechanism, the distinction between the 
various mechanisms which have been proposed for lattice diffusion is 
much less clear in the disorganized regions of a grain boundary; it 
may be quite impossible for example to distinguish in the boundary 
between an interstitial position and a vacant lattice site. 35 The 
observed anisotropy and the nature of the variation of D with orienta- 
tion indicate that the increase in diffusion rates at grain boundaries 
arises mainly from the effect their changed structure has on the mobility 
of the atoms composing them rather than that it is principally due to the 
grain boundaries providing a higher concentration in their neighbour- 
hood than elsewhere of, say, vacancies, and so increasing diffusion rates 
over a region on either side of the grain boundary more extensive than 
that over which the lattice is irregular. In the latter case d would be 
appreciably greater than the rough value of 5 . 10~ 8 cm usually assumed 
for it. 

It was at one time suggested 45 that the same unit atomic process was 
responsible for grain boundary slip, for constant rate creep in single 
crystals and for volume diffusion because their activation energies 
appeared to be the same, at least for a-Fe, a-brass and Al. The unit 
process considered was an exchange or re-arrangement of atoms in what 
Kfe described 61 as "disordered groups" of atoms. These exist, he 
supposed, in certain concentration within the grains and in much higher 
concentration where they formed grain boundaries.- Since they would 
also give rise to grain boundary diffusion, the inference is drawn that 
Q gb should also be equal to the activation energy for the three above- 
mentioned processes. Of particular interest is the predicted equality of 
Q gb and Q t . But apart from the fact that experiment shows Q gb to be 
always quite appreciably less than Q ly a recent and more accurate 
redetermination of the activation energy for self-diffusion in iron 46 and 
a study of self-diffusion 47 and of grain boundary slip 48 in tin* show that 

292 



DIFFUSION IN METALS 

the activation energies for KE'S three processes are certainly not equal 
for tin and iron, the activation energies for self-diffusion being appreci- 
ably less than those for grain boundary slip and for creep. These 
processes are not then all intimately related to one another through a 
common unit process, or to grain boundary diffusion and there is no 
reason for expecting Q gb to equal Q lf Should they be equal then 
differences between D gb and D t would reside wholly in the constant 
term in the Arrhenius equation ; a grain boundary model which would 
then explain the observed differences has been discussed by LE CLAIRE. 30 

It has often been suggested that impurities in metals may have a 
tendency to segregate in grain boundaries and the first convincing 
demonstration of a case where this is so has recently been given 125 the 
case of Po impurities in PbBi alloys. The polonium impurity was 
produced by decay of radioactive Bi incorporated in the alloy and 
although present at as low a concentration as 1 part in 10 10 , was shown 
by autoradiography to be present in the alloy after annealing in much 
higher concentration at the grain boundaries than elsewhere. Impurities 
which do segregate in this way will most probably have an effect on 
grain boundary processes out of all proportion to their total concentra- 
tion. Thus KE found 60 that very small amounts of impurity completely 
block grain boundary slip in aluminium, although HOFFMAN and 
TuRNBULL 29 found little change with purity in the observed grain 
boundary self-diffusion rates in silver. BIRCHENALL and MEHL, 49 
however, found that while at high temperatures the self-diffusion 
coefficients in poly crystal line samples of iron of various purities were 
substantially the same, at low temperatures it was greater in the iron of 
commercial purity than in more pure material. They suggest that 
perhaps in a pure metal D gb and D l are normally the same but that 
impurities accumulating in grain boundaries have the effect, at least in 
commercial iron, of increasing D gb and so also the apparent low tempera- 
ture self -diffusion coefficient. But these results are not conclusive 
because they had no quantitative evidence on the relative grain sizes 
of the various purity materials ; the apparently smaller grain size in the 
less pure iron, if in fact sufficiently smaller than that of the pure iron 
would equally well explain the results. 

It would be surprising if impurities alone were responsible for the 
increase in grain boundary diffusion over lattice diffusion rates but their 
importance in affecting values of D gb is a matter which merits further 
and more extensive investigation. 

Surface Diffusion 

A number of interesting experiments have demonstrated the very rapid 
mobility of surface metal atoms and summarizing reviews 26 ' 92 of these 
are already available in the literature. In a very few cases quantitative 
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estimates of D s and of Q s have been made which show Q s to be less than 
both Q gb and Q t , and D B to be greater than D gb and D l which from the 
nature of the process is not unexpected. In the last few years some 
further contributions have been made in this field which generally 
confirm these results. 

It will be clear that FISHER'S analysis 27 of grain boundary diffusion 
is, with similar assumptions, directly applicable to surface diffusion if 
in his equations D gl> is replaced by D s and d by \d s , 6 S being the effective 
width of the diffusing surface layer. The factor i arises because material 
is, of course, only lost by lateral diffusion into the grain from one side 
of the surface layer. But no use yet appears to have been made of this 
analysis in surface diffusion experiments. 

NICKERSON and PARKER SI have made measurements on the surface 
self diffusion of silver and obtained the result 

D s = 0-16 exp ( 10,300/#T) em 2 /sec .... (11) 

Radio active silver was deposited at one small region of a bundle of 
silver wires and the surface diffusion coefficient calculated from the 
distribution along the wire of the active silver after annealing at various 
temperatures between 225 and 350C. To measure the distribution the 
wire was scanned with a counter tube fitted with a lead collimator and 
thin slit so that at each position the activity of only a small section of 
the wire was measured. However, the results were analysed without 
regard to loss of material from the surface into the grains : the wires 
used were, in the absence of any indication to the contrary, presumably 
polycrystalline so that surface atoms would also be lost into grain 
boundaries. For these reasons and in view of the considerable scatter 
among the results equation (11) represents only a rough estimate of D s . 
However, taken with HOFFMAN and TURNBULL'S results we now have 
a more complete account of self-diffusion processes in Ag than in any 
other metal. 

A similar method has been used by FRAUNFELDER 52 to study the 
surface diffusion of copper (Cu 64 ) on silver with the single result that at 
750C, D 8 = 7 . 10~ 2 cm 2 /sec. 

MtiLLER 53 has used an ingenious method to study the surface self- 
diffusion of tungsten. The potential difference required to maintain a 
given electron emission from a fine tungsten point is a function of its 
radius of curvature. Changes in shape occur when the point is main- 
tained at a high temperature (1600 to 2500C) and assuming such 
changes are due to surface diffusion MULLER has calculated from the 
change in potential with time that the activation energy for surface 
self-diffusion of tungsten is 106,500 cal/mole. This is appreciably less 
than the value of 143 cal/mole for lattice self-diffusion estimated by 
VAN LiEMPT. 54 It is interesting to note that the ratio QjjQi is much 
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higher for tungsten than for silver but it should be remembered that 
VAN LIEMPT'S estimate is not from direct radio active measurements. 
He merely assumes that his values of the coefficient for diffusion of iron 
in tungsten, being measured at very low iron concentration, represent 
also the self-diffusion coefficient of tungsten. His value for Q l may then 
be an underestimate of the true value. 

SECTION III 

THE MECHANISM OF DIFFUSION 

1. Diffusion in Interstitial Solid Solutions 

It has never been doubted that the diffusion of solute atoms in inter- 
stitial solid solutions occurs by successive jumps of the solute atoms 
from one interstitial position to another, for it is difficult to imagine 
any alternative mechanism in this case. 

Recently there have become available new methods for measuring 
interstitial diffusion coefficients (see Section IV) which extend the 
temperature range over which such measurements may be made and so 
provide more accurate values than have before been available of the 
theoretically important quantities D and Q in the usual equation 
employed to express the temperature dependence of Z), viz 

D = D Q exp(-Q/ET) .... (12) 

Prompted by this important development, renewed attention has 
recently been given, notably by WERT and ZENER, 56 57 58 to a theoretical 
interpretation of D Q . 

Treating the process of diffusion as a random walk problem, it may 
readily be shown that for a cubic lattice the diffusion coefficient D for 
any one species of atom is given in terms of r, the average time between 
successive jumps of each atom of that species, by the expression 

D = <xa 2 /r .... (13) 

where a is the lattice constant and a is a geometric factor depending 
on the disposition round any one equilibrium site of neighbouring sites 
into which the atom may diffuse by jumping, a is easily computed and 
turns out, for interstitial diffusion, to be 1/24 for a b.c.c. and 1/12 for a 
f.c.c. lattice. The remaining problem is to determine 1/r. 

WERT 57 and WERT and ZENER 56 have applied for this purpose the 
essential assumptions and methods of EYRiNG's 62 theory of reaction 
rates but have adapted it to this particular problem more carefully 
than previous workers. The most general expression for the rate of 
an activated process is, according to EYRING 

I/T = p . (kT/h) . F*/F 
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where k, T and h have their usual meaning and p is the number of 
equivalent paths for diffusion of an atom from any one position of 
equilibrium (for interstitial diffusion p is 4 in b.c.c. and 12 in f.c.c. 
lattices). F and F* are the partition functions for the system in its 
initial and activated states respectively i.e. for an atom in its normal 
equilibrium position and at the top of the potential barrier separating 
it from the next position to which it is diffusing. Now in evaluating F*, 
the degree of freedom at the saddle point corresponding to movement 
over the barrier is omitted. F*/F is usually written as exp ( kG'jRT), 
where A6?' is referred to as the "free energy" change in passing from 
the equilibrium to the activated state. But this quantity is not strictly 
a free energy for it is the ratio of two partition functions with unequal 
numbers of degrees of freedom. In order to obtain a similar ratio of 
partition functions with equal numbers of degrees of freedom WERT 
and ZENER 56 write 

F = F v . F 1 

where F v is the partition function for a single linear oscillator of 
frequency v, this being considered to represent the vibrational motion 
of the atom about its equilibrium position and along the direction of 
impending diffusion. F l is then the partition function for the system 
with the atom constrained to move in a plane normal to the direction 
of diffusion and passing through the equilibrium position. For the 
temperatures at which diffusion measurements are made it is quite 
sufficient to approximate the usual expression for F v by F v = kT/hv 
so that we arrive at the result 

1/r = pv . F*/F 1 

Since F* and F l now refer to two states with equal numbers of 
degrees of freedom, we may, in writing 

I/r = pvexp (- &G/RT) .... (14) 

legitimately refer to AC? as a free energy change and so equate it to the 
isothermal work done in moving an atom from an equilibrium position 
to the top of the potential barrier separating it from the next equilibrium 
position, the atom only being constrained during this motion to move 
in directions normal to the line joining the two equilibrium positions. AC? 
in (14) has therefore a clear physical significance. 

A derivation of equation (14) from first principles has been given 
by WERT. 57 

Assuming the isothermal work to be done at constant pressure we 
may make use of the thermodynamic relation 

A(?= Aff-T. AS 
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in writing, from equations (13) and (14), the expression for D. 

D = p.a 2 v exp (&S/R) exp (- &H/RT) .... (15) 

A$ and A// are referred to respectively as the "entropy" and "enthalpy" 
(or heat) of activation. 

We also have the additional relation 



so it follows that 



and hence that A// in equation (15) corresponds with Q in equation (12). 
Z> is, therefore, given by the relation 

D Q = #aa 2 i> exp (A/S/fl) .... (16) 



= 

\ 3T /P I M' 



WERT and ZENER estimate v by assuming the potential energy of a 
solute atom to vary sinusoidally as it moves from one interstitial lattice 
position to another with an amplitude taken as equal to the heat of 
activation &H. With such an assumption they obtain 

v = (A///2mA 2 ) 1/2 

where A is the distance between interstitial positions and m is the mass 
of the atom. 

The only unknown quantity now appearing in the expression for D Q 
is A/S. This may be evaluated by noting the further thermodynamic 
relation. 



If we assume that all the work AC? appears as strain energy in the 
lattice surrounding the diffusing atom when cat the saddle point, then 
we may put 

.... (17) 



where [i is some shear modulus of the lattice, e a measure of the shear 
strain, and v the volume over which the strain is apparent. The 
variation with temperature of that part of the experimentally observed 
modulus which is associated only with lattice elasticity, and which does 
not contain a contribution from grain boundary relaxation processes, is 
found to be quite closely linear. 63 Therefore, we may write 

[i = // + TZjLi/dT .... (18) 
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and if we ignore any possibly small temperature variations of v and e y 
we find 

Aff = (l/2)^v 
and 

AS - - A# . 3(i*/ fiJ/dT. .... (19) 



is always found to be negative so that AS is essentially positive 
and exp (&S/R) always greater than one. 

Inserting the known or now estimated values of p, a, v and a 2 into 
equation (16) values of A exp can be calculated from experimental 




Fig. 9. Theoretical and Experimental Values of A#/72 for Interstitial 

Diffusion 58 

Oxygen in Ta Kfi 110 (1948) 
Nitrogen in Ta Kfi 111 (1948) 
Carbon in Ta Kfi 111 (1948) WERT 58 (1950) 
Carbon in aPe vSNOEK 102 (1941) STANLEY 109 (1949) 

WKRT and ZENER 56 (1949) WERT 57 (J950) 
Nitrogen in aFe SNOEK IOZ (1941) WERT and ZENER 58 
(1949) WERT 68 (1950) 

values of D Q and compared with the values of A$ Theo given by equation 
(19). For this latter calculation WERT has used the temperature 
coefficients of the polycrystalline bulk shear moduli as deduced from 
KosTER's 64 measurements for a large number of materials of the 
temperature dependence of Young's moduli. A comparison of theory 
with experiment has been made by WERT SS by plotting A exp against 
A$ Theo as in Fig. 9. Full agreement with the foregoing theory is 
indicated for those points which lie on the line of slope one. The most 
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accurate experiments to date are those for C in a-Fe 57 in view of the 
very extensive temperature range ( 40 to 800C) over which they 
have been made, and in this case the agreement is excellent. WERT 
points out that both for this and other cases, repetition of measure- 
ments by different observers appears to lead to continuously improving 
agreement between experiment and theory and so by implication 
attributes remaining discrepancies to there still being room for improve- 
ments in the accuracy of D . The arrows in Fig. 9 run from earlier to 
later experiments. 

In an earlier paper, WERT and ZENER 56 consider the possibility that 
not all the energy AC? may go into straining the lattice in which case AS 
may be allowed to take any value between zero and that given by 
equation (19). The experimental values suggest, however, that AS 
always takes its full value. 

This work on interstitial solutions represents a considerable advance 
in the subject of diffusion, being the first successful theoretical inter- 
pretation of observed values of Z> . Its success is due in no small 
measure to the recent availability of more accurate values of D for 
previous attempts 1 in this direction have been made by use of an 
equation very similar in form to number (15). The equation for l/r 
was often derived simply by multiplying the probability that an atom 
had the activation energy necessary to carry it over the potential 
barrier, as given by the Boltzmann factor, by the number of times per 
second the atom was moving in the right direction to diffuse, i.e. v. 
Since the total energy of activation rather than the free energy appeared 
in these expressions, the equations resulting did not contain the term 
exp (&S/R). But attempts were sometimes made to remove the dis- 
crepancy between experiment and theory which thereby resulted by 
taking explicitly into account the temperature dependence of the total 
energy of activation. When the energies involved vary linearly with 
temperature, such a procedure leads to much the same expressions 
for D Q and Q as the above and so with reliable data to work on, and 
perhaps in some cases a little more careful attention to geometric 
factors, could have succeeded in relating theory with experiment. 
However, it may be shown 65 to be thermodynamically quite unsound 
to employ Boltzmann factors containing total energies (in the sense of 
the first law of thermodynamics) which are functions of temperature ; 
the correct procedure in such cases is to replace the total energies 
normally appearing in the exponential by the corresponding free 
energies of the states to which they refer. Although the derivation of 
I IT which we have discussed is the more rigorous and much to be 
preferred, the earlier simple kinetic derivations would have produced 
the same results, without recourse to thermodynamically unsound 
practices ,*had the correct form of the Boltzmann factor been employed. 
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2. Diffusion in Substitutional Solid Solutions and Pure Metals 

It is by no means so obvious in this case by what atomic process diffusion 
occurs for even on casual reflection several possibilities occur. It 
becomes necessary then to consider these possibilities in turn and to 
choose from them, as a model upon which to base discussion, the one, 
the consequences of which can be related the more closely to experi- 
mental measurements and observations on diffusion phenomena. 

(a) Comparison of Observed and Calculated Activation Energies The 
three types of process which have been the subject of most discussion in 
the past are the direct exchange mechanism, the vacancy mechanism, 
and mechanisms involving atoms in interstitial positions. These were 
described and discussed in PMP.l, where it was pointed out that while 
the vacancy mechanism had often been preferred in the past because it 
was felt that it would require a much lower energy of activation than 
the other mechanisms, this feeling was only confirmed, at least for the 
close packed metal copper, by HUNTINGTON and S^iTz's 66 * 67 direct 
quantum mechanical calculation of the activation energy Q for all 
three processes. (In the particular type of interstitial mechanism chosen 
for consideration by these authors, the interstitial atoms, assumed 
present in very small concentration in thermodynamic equilibrium with 
the lattice, diffuse by moving on to an adjacent substitutional site, 
pushing the atom occupying it on to the next interstitial site. This has 
been termed "Interstitial cy Diffusion" by SsiTZ 15 to distinguish it from 
the more usual concept of interstitial diffusion, where the atoms move 
directly from one interstitial position to the next, and which would 
require a larger activation energy.) 

Their estimated values of Q were, 66 in kcal/mole, 400 for the 
exchange mechanism, 230 for the interstitialcy mechanism, and, with 
a more refined calculation, 67 64 for the vacancy mechanism. When 
these are compared with the experimentally observed value of ~ 50 
for self-diffusion in copper, it becomes clear that the vacancy mechanism 
is by far the most likely of the three. A recent re-estimation 68 of the 
value for the exchange mechanism reduces it to about 240 and so does 
not vitiate this conclusion. The inference is usually drawn from these 
results that for all close packed lattices the vacancy mechanism 
represents most closely the actual process given rise to diffusion in them. 
But the calculations allow of no similar conclusion regarding the 
mechanism in non-close packed lattices. In fact, since it is the ion core 
repulsions which play the dominant part in rejecting the possibility of 
the exchange and interstitialcy mechanisms in such a metal as copper, 
these mechanisms would be in much closer competition with the 
vacancy mechanism in calculations on less close packed metals, where 
the ion core repulsion is much reduced. 
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For a long time serious consideration was given only to the three 
above-mentioned mechanisms, but recently other mechanisms have 
been suggested representing more generalized versions of these. Thus, 
for example, ZENER 69 has pointed out that the simple exchange 
mechanism is only a special case of what he terms "ring diffusion" in 
which a number n of atoms lying on a ring, so that each is a nearest 
neighbour of the one before and after it, diffuse simultaneously by 
synchronous movement round the ring. Fig. 10 illustrates this for 
n = 4 in a f.c.c. and b.c.c. lattice. The exchange mechanism is clearly 
the simplest case of n 2. ZENER has carried out detailed calculations, 
similar to those of HUNTINGTON and SEITZ, of the activation energy 
required for ring-diffusion processes in copper. He shows that contrary 
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Fig. 10. 4-Ririg Diffusion 69 
(a) f.c.c. lattice 
(6) b.c.c. lattice 

to what might at first sight be expected the activation energy does not 
increase with n, but is appreciably less for n = 4 than for other values. 
Although more atoms are involved in a 4-ring than in a simple exchange 
or 2-ring process, the disturbance attending it in the surrounding lattice 
and hence the energy required for its performance, is less. In fact, the 
coulomb interaction of the closed shells at the saddle point is almost 
negligible and it is only the exchange interaction which contributes 
significantly to the activation energy. ZENER finds this to be ~ 90 kcal, 
an estimate which, although very considerably nearer than those for 
the simple exchange and interstitialcy mechanisms to the observed 
value of ~ 50 kcal/mole, is still by no means in as close agreement with 
it as the estimate of 64 for the vacancy mechanism : and such refine- 
ments to the calculations which might be expected to bring it closer to 
the experimental value would, at the same time tend further to improve 
the agreement with experiment of the estimate for vacancy diffusion. 15 
There would then appear to be no very strong grounds for replacing the 
vacancy by the ring mechanism as the most likely process of diffusion 
in close packed metals. As we shall see shortly there is in fact stronger 
evidence against the ring mechanism : but in the absence both of such 
evidence and similar calculations for less close packed metals ring diffu- 
sion must*remain in their case a possible process to be considered. 
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PANETH 70 has calculated the activation energies for diffusion by the 
above-mentioned processes in the alkali metals and in this way has been 
led to consider yet another process of diffusion representing a generaliza- 
tion of the inters titialcy process. He found that while the energy to 
form an interstitial atom in sodium is very much less than for copper, 
the energetically most favourable position for the interstitial atom is 
actually midway between a pair of nearest neighbours, for an atom in 
such a position produces a very considerable amount of relaxation by 
the neighbours and by atoms lying along the line of the neighbours. 
A defect is thereby produced, termed a "crowdion" which may be 
described as a row of n nearest neighbour atoms compressed so as to 
occupy a distance normally occupied by n 1 atoms, or as PANETH 
puts it, as an interstitial atom diluted over several atomic spacings. 
Diffusion then occurs by propagation of the region of compression along 
the direction of the crowdion and PANETH shows that a crowdion of 
about 8 atoms has a much lower energy of activation for diffusion than 
either the vacancy or ring mechanism. PANETH'S estimate of c . 1 
kcal/mole for sodium awaits experimental confirmation. 

By way of a generalization of the simple vacancy model we may 
mention that SErrz 22 has discussed the possibility that diffusion may 
occur by the migration of coupled pairs of vacancies and suggests that 
this may be faster than diffusion by single vacancies because of the 
lower closed shell interaction during the exchange of an atom with 
either of the vacancy pair. Such double vacancies are thought 71 to play 
a role in the changes in resistance of metals on annealing after plastic 
deformation, and may be of importance in chemical diffusion on account 
of the Kirkendall effect, as mentioned in Section I. 

(6) Evidence from, the Kirkendall Effect A necessary and important 
property of the exchange or ring diffusion mechanisms is that they can 
never lead to any nett displacement of atoms relative to the crystal 
lattice or to any changes in volume or shape as a result of diffusion, 
other than those arising from differences in the atomic sizes of the 
interdiffusing species. The work of SMIGELSKAS and KIRKENDALL, 
reviewed in PMP.l and many more recent experiments designed mostly 
to confirm and extend it and reviewed in Section I, have together 
demonstrated that all these forbidden phenomena are, in fact, associated 
with diffusion, at least in f.c.c. lattices, and so provide the most direct 
evidence to date against an exchange or ring mechanism in such 
structures. A Kirkendall effect has now been observed in about nine 
different systems and as these are all of the face centred cubic structure 
we may probably assume that a ring mechanism is most, improbable in 
all alloys of this structure. No Kirkendall effect experiments have yet 
been performed which would assist in elucidating the mechanism of 
diffusion in alloys of other structures. As shown in Section I the 
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vacancy mechanism does provide a satisfactory picture of all the 
phenomena associated with the Iirkendall effect, and although an 
interstitial mechanism might also be invoked to do the same, the 
calculations referred to in the last section, with the evidence mentioned 
here, provide very strong evidence for a vacancy mechanism. 

(c) Miscellaneous Evidence Relating to a Vacancy Mechanism It 
has often been proposed that if diffusion occurs by vacancies then 
since the equilibrium concentration of vacancies increases rapidly with 
temperature it ought to be possible to quench a metal so as to produce a 
concentration of vacancies in excess of that corresponding to equilibrium 
at the temperature to which the material is quenched. But it appears 
from studies 15 of the problem that the normal density of dislocations in a 
metal, which can act as sources and sinks of vacancies, is, at the 
temperatures at which conventional diffusion experiments are made, 
sufficient to restore equilibrium in a time very short compared with that 
required for the measurements. The excess vacancy concentration in 
such quenched materials will not then lead to an abnormally high 
diffusion coefficient if one attempts to measure this by conventional 
means. Thus quite recently HOFFMAN and TuRNBULL 29 found no 
difference between the self-diffusion coefficients of silver samples 
whether these were air quenched or slowly cooled before the experiments. 

However, the recently developed anelastic methods (see Section IV) 
which allow measurements of diffusion coefficients at much lower 
temperatures and in shorter times than by the conventional methods, 
have been used by NowiCK 72 successfully to demonstrate that D is, in 
fact, larger in a quenched alloy (of 29 atoms per cent Zn in Ag) than in 
the same alloy more slowly cooled. The method is such, of course, that 
any number of diffusion experiments may be made on the same sample 
and so one may follow the rate at which the increased diffusion coefficient 
falls to the normal value appropriate to slowly cooled or annealed 
alloys, as the vacancy concentration returns to its equilibrium value. 
In this way NOWICK estimates that AJ?,, the activation energy for the 
formation of vacancies, is ^12-5 kcal/mole and A// 2 , that for their 
exchange with atoms, ~ 20-4 kcal/mole, corresponding to the total 
activation energy Q for diffusion of 32-9 kcal/mole. It is interesting in 
connexion with remarks in the next part of this section that ^H 1 /Q ~ 0-4. 

Thus is confirmed a necessary consequence of the supposition that 
diffusion occurs by vacancies. 

There are many properties of metals which are intimately related in 
some way to their diffusion behaviour so that a study of such properties 
may be of use in furthering a knowledge of the diffusion processes 
themselves. By assuming that vacancies exist in the lattice and that 
diffusion occurs by the migration of vacancies, many of these properties 
have beeil successfully interpreted using the known values of diffusion 
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coefficients, or conversely have provided diffusion data agreeing with 
that obtained by direct measurement. Such work then strengthens the 
evidence in favour of the vacancy mechanism in the f.c.c. structures foi 
which these other properties have been successfully so interpreted. 
The work on sintering and on the creep of wires will be discussed in 
Section IV. These involve changes of shape at rates determined by 
diffusion processes and so like the Kirkendall effect rule out the 
possibility of an exchange or ring mechanism. It has been suggested 73 ' 74 
that the increase in electrical resistance of metals after cold work is 
due to the generation of an increased vacancy concentration by the 
movement of the dislocations. MANiNTVELD 71 has recently studied the 
rate of return of the resistance to its normal value on annealing, a rate 
determined by the migration rate of the vacancies, and has calculated 
the activation energies for the movement of single vacancies in Au, 
Ag and Cu to be 16, 15 and 20kcal/mole, respectively. Although 
appreciably less than half the corresponding total activation energies for 
diffusion, these are not unlikely values. The same experiments provide 
possible evidence for migration of pairs of vacancies, with activation 
energies rather less than those for single vacancies. 

There is also an increase in resistance of pure metals after neutron 
irradiation. Since such treatment can only produce vacancies or inter- 
stitials, it is most interesting that the activation energy for the recovery 
of the resistance on annealing turns out, at least for platinum, 131 to be 
equal to that for recovery of the resistance produced by cold work in 
the same metal. 

It should be mentioned that such experiments as these constitute 
"evidence" for a vacancy mechanism only because the calculations of 
HUNTINGTON and SEITZ show an interstitial mechanism to be most 
unlikely on energetic grounds. In the absence of such calculations, 
NOWICK'S experiments, for example, could as readily be explained on 
the supposition that diffusion occurs by any mechanism involving lattice 
defects whose concentration increases with temperature. 

Attempts have also been made 75 ' 76 to obtain evidence for a vacancy 
mechanism by studying diffusion in alloys such as FeAl, NiAl and 
CoAl which are known to possess a defect structure in that in certain 
ranges of composition a large number of lattice sites are vacant. Thus, 
for example, 77 on the cobalt rich side of approximately 50-50 atomic 
per cent b.c.c. alloys of CoAl, the cobalt atoms replace aluminium, atom 
for atom, but on the aluminium rich side, although the b.c.c. structure is 
maintained, the number of atoms per unit cell is less than two and many 
sites are in consequence vacant. If such vacancies facilitate diffusion 
then a marked increase in the diffusion coefficient should occur as the 
concentration passes through 50 per cent towards higher aluminium 
content. Such an increase has in fact been observed by^Kix and 
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JAUMOT 76 for the diffusion of cobalt in CoAl alloys. BURGESS and 
SMOLUCHOWSKi, 75 however, found no similar increase in Ni Al alloys and 
attributed this to the inability of the vacant sites in such alloys to play 
the role required of them in diffusion on account of their small size 
compared to that of the aluminium atom. 

One would expect a similar effect with no increased diffusion in the 
Co/Al alloys and so the evidence from these experiments is not very 
helpful, but it is in any case difficult to see why such experiments could 
be taken as providing evidence for a vacancy mechanism except in 
these rather special defect alloys. Any increase in diffusion in the 
concentration range where the alloys are defective may certainly be due 
to a vacancy mechanism, but this throws no light on the mechanism 
in the non-defective structures, which, being body centred cubic, is 
much needed. 

(d) Theory of D Diffusion coefficients for self-diffusion and for 
chemical diffusion in substitutional solid solution are well known to 
vary with temperature through the Arrhenius type equation (12). Just 
as in Section 2 (a) a comparison with experiment of the values of Q to 
be expected from the various proposed models for diffusion has enabled 
a choice to be made between these models, so in general may we expect 
it to be possible to make or confirm the choice by comparison with 
experiment of observed values of D for the various models. 

The most forbidding feature of the reported values 78 of Z> for chemical 
diffusion is that they cover the alarming range of 10~ 9 to 10 cm 2 /sec. 
For self -diffusion D lies, however, in the far more moderate range of 
lO" 1 to 10 cm 2 /sec and so for this reason and because it is the simplest 
possible type of diffusion in metals, it is natural that attempts should 
first be made to understand the values of Z) associated with it. 

A careful reassessment 79 ' 80 of the reported work on self-diffusion 
measurements has provided what are on the evidence available, the 
most likely values of D and Q for the f.c.c. metals Au, Ag, Pb and Cu. 
An interpretation 79 ' 81 of this data by a method which is an extension of 
that used so successfully by WERT and ZENER 56 " 58 for interstitial 
diffusion, has shown that the values of Z) for self-diffusion in these 
metals are consistent only with, and, therefore, confirm the operation 
of, the vacancy model of diffusion. The same methods applied to the 
b.c.c. metals suggest that self-diffusion occurs in them by a 4-ring 
diffusion process. 

(i) Self Diffusion Equation (13) for D holds equally well for 
vacancy diffusion provided it can be assumed that successive jumps of 
any individual atom are random in direction and independent of one 
another. This assumption, valid certainly for dilute interstitial solid 
solution, is not precisely valid for vacancy diffusion for an atom which 
has just exchanged with a vacancy will be more likely to make its next 
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move back to its original position rather than in any other direction. 
The error involved in making it (~ 10 per cent) 20 is well within the 
error of most experiments and so at present unimportant, a in equation 
(13) is readily shown to be 1/12 for an f.c.c. 105 and 1/8 for a b.c.c. lattice. 

If J3 is the fractional concentration of vacancies, then in an f.c.c. 
lattice 1 2/f is the probability that an atom has a vacant site as neighbour. 
Only then, of course, may the atom move. 

If ri is the rate of jumping of an atom into an adjacent vacant site, 
then clearly 

l/r = I2pn r .... (20) 

where r is as previously defined (p. 295). Similarly for a b.c.c. lattice 

l/r = 8pn' .... (21) 

If AC?! is the change in free energy on forming a vacancy at constant 
temperature and pressure, apart from that contributed by changes in 
mixing entropy, then the equilibrium concentration of vacancies is 
readily shown to be 

P = exp (- AGyTJT) .... (22) 



ri may be evaluated by methods identical with those used by WERT 57 
for interstitial diffusion, when we find 

ri = v exp (-- AGy/ZT) .... (23) 

where v is the frequency of vibration of an atom on a lattice site and 
AC? 2 is an amount of isothermal work defined exactly as on p. 296. 
Using the same thermodynamic methods and relations as for the 
interstitial case we find from equations (13) and (20) to (23) that for 
self diffusion in both b.c.c. and f.c.c. lattices, 



D = a*v exp [(A^ + AS 2 )/] exp [- (A# x + &H 2 )/RT\ . . (24) 



where A$ x and ^H l are the entropy and enthalpy associated with the 
free energy AG^, and similarly for AS 2 and A# 2 . (A// x + A// 2 ) cor- 
responds to Q, the observed activation energy, and the constant term 
D is given by 

DO = a*v exp [(ASi + A/S 2 )/jR] .... (25) 



As before, if the work A(? 2 appears as strain energy in the lattice A$ 2 
will be positive. And also 81 A$ x will be positive for when a vacancy is 
formed in the lattice the neighbouring atoms acquire a greater freedom 
of movement so that their entropy must increase. Thus the total entropy 
of activation (AS X + A$ 2 ) = A$ must be positive. ZENER SI points 
out that this is, in fact, true for self-diffusion in all cubic metals. 

As we have seen, the calculations of HUNTINGTON and SEITZ provide 
values of A//! and A# 2 agreeing quite closely with experiment, but no 
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such corresponding calculations, necessary to complete the theoretical 
interpretation of D, have yet been made for A/S^ = 9A(? 1 /3T and 
AS 2 = d&G 2 /5T. Very rough estimates have from time to time been 
made (see PMP.l) and do produce values of D Q in order of magnitude 
agreement with experiment. An exact calculation of AS X and AS 2 
would be very difficult, but we may, as in the interstitial case, at least 
obtain an expression for AS in terms of A//, although the analysis is 
more complicated here by there being two terms in the activation 
energy. However, we proceed in a similar way and propose a correlation 
between both AQ^ and AG 2 and other physical properties of the lattice 
to which we might expect these quantities to be most closely related. 
Thus we write again equation (17) for A6? 2 . For geometrically similar 
lattices e will be the same and we may suppose v to be proportional to 
the atomic volume M/p, where M is the atomic weight and p the 
density, so that we may write 

AC? 2 = Tc 2 M^p .... (26) 

where & 2 is a constant for metals of one structure. A//J is the change in 
internal energy on forming a vacancy and it has often been proposed 
that this will be closely related to the latent heat of sublimation L s , so 
we put 82 

A# x = fc^., .... (27) 

Assuming // and p to vary linearly with temperature we then find, 
using the equation AS = DAG/DT, that 

. . . . (28) 
. . . . (29) 

where the subscript refers to the constant term in the linear expressions 
for IJL and p as in equation (18) for //, and the dash denotes differenti- 
ation with respect to temperature. The second term in (29) is negative, 
so that AS remains positive. 

We may eliminate k 2 from (29) using (28) and so find for AS 

... (30) 



The experimental value of AS is given by R log (D /a 2 v) so & x may be 
calculated for each of the cubic metals for which Z> and Q are known 
by inserting in equation (30) appropriate values for the other quantities. 
ASj, the change in entropy on forming a vacancy, arises principally from 
the lowering of the vibrational frequency of the atoms surrounding 
the vacant site, but a proper calculation of this frequency change does 
not appear to have been made. It is to be noted, however, that the local 
values neo-r a vacant site, certainly of JLI and most probably of 
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will be less than the bulk values, and these changes in elastic properties 
and in v will be closely related. It is the local values of (^/ju o y which 
should of course be used in (30). But since k L derived from equation (30) 
decreases with decreasing A$ x and increases with increasing (JU/JUQ)' we 
shall assume for simplicity that ^ is unaffected if we put A/S^ in 
equation (30) and employ in the calculations the bulk value of (////-*(,)'. 
For v we may take the Debye frequency. Using for (///^ )' the same 
source of data as employed in Part 1 of this section, it is found 79 
that ki is very appreciably the same for the four f.c.c. metals for which 
all the relevant data exists, as shown in Table 2. 

TABLE 2 





Pb 84 


Ag 29, 31 


Au 83 


Cu 80 


a-Fe 46 W 64 


D CW 2 /.96C 


4-02 


0-895 


0-157 


0-178 


5-8 


11-5 


Q kcalfmole . 


27-5 


45-9 


53-0 


46-5 


59-7 


142 


k, 


0-21 


0-21 


0-20 


0-23 Mean k l --=- 0-21 


- 0-075 


- 0-147 


k, 


0-23 


0-22 


0-25 


0-18 MeanA- 2 =-- 0-22 







The constancy of k 1 for the f.c.c. metals justifies the correlations 
(26) and (27) and the assumption of a vacancy mechanism on which 
they were based. The mean value of k gives for Cu, for example, 
17-4 kcal/mole as the energy required to form a vacancy. This, being 
about 0-4 of the total activation energy is a very reasonable value, 
although a little lower than the value of 0-64$ estimated by HUNTING- 
TON and SEiTZ. 66 ' 67 But since both this and the present estimate are 
only approximate, the discrepancy does not invalidate the present 
analysis. 

However, we see from Table 2 that the values of k 1 for the b.c.c. 
metals are not only well removed from those for f.c.c. but are in addition 
negative; a negative k^ in equation (27) is meaningless. This may 
indicate inaccuracies in the measurements for W (see also p. 295) and 
a-Fe or that in their case a vacancy mechanism of diffusion does not 
operate. It is very interesting that the observed values of D and Q for 
these two metals can be satisfactorily related only on the assumption 
that diffusion occurs in them by a 4-ring exchange mechanism. 

The diffusion coefficient for self-diffusion by a ring mechanism may 
be shown 81 to be given by 

D = yva 2 exp ( &F/RT) 

where y is an integer always greater than 1 and depending on the 
number of the ring and upon the type of lattice. &F is the free energy 
of activation for the ring diffusion process and, as before, we may put 
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and so find 

AS ring = Q{(rli*oY - (PIP.)'} - R log (/> /yM 

Thus y may be calculated from the equation 

Rlog 7 = Rlog(D la*v)-Q{(t*!v<>Y-(p/P<>Y} .... (31) 

and the values turn out, for f.c.c. lattices, to be not only far from 
constant but all small fractions inconsistent with the supposition of a 
ring mechanism. But for W and a-Fe they are greater than 1 and both 
closer to the value of y ( 6) for a 4-ring diffusion process than for any 
other numbered ring. 

An analysis similar to the above, with the same type of assumptions, 
has been carried out 79 for interstitial diffusion but fails to provide values 
of AS and, therefore, > , which are consistent with experiment either 
for b.c.c. or f.c.c. metals. 

The methods outlined here, which avoid any direct calculation of D Q 
show that observed values of it may be satisfactorily interpreted on the 
basis only of a vacancy mechanism for f.c.c. metals and of a 4-ring 
mechanism for b.c.c. metals. Both results are quite consistent with 
previous evidence on the mechanism of diffusion in these structures. 

Equation (30) may be written in a simpler form by noting, as was 
pointed out by STEIGMAN, SHOCKLEY and Nix 119 sometime ago (see 
also ref. 31), that the ratio Q/L S is fairly constant from metal to metal 
and of mean value about 0-6. Putting L s = Q/0-6 in equation (30) 
we find 

'} .... (32) 



If further we ignore (p/p Q Y in the above analysis, as being smaller 
than (jM/jMo)' an( l recalculate a mean value of k : (the separate values of 
&! turn out to be not quite so constant as in Table 2), we find that 

AS = 0-550(^ )' (33) 

which is exactly the result obtained by ZENER SI who plotted log (D /a 2 v) 
against Q(/V^o)Y-^ an( ^ f un( l the points for f.c.c. metals to be fairly 
close to a line of slope 0-55. ZENER, however, provides no clear under- 
standing of the significance of the numerical factor. As would be 
expected from the foregoing, the points for b.c.c. metals lie well above 
the line of slope 0-55 and ZENER interpreted this as probable evidence 
for a ring diffusion process. 

With the now known value of k lt k 2 can be calculated from equation 
(29) or preferably equation (28). An examination of the displacements 
of its neighbours during the diffusion of an atom shows that these 
correspond crudely, when resolved in two directions at right angles, to 
atomic movements associated with shears for which C 44 and (C n C l2 )/2 
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are the appropriate moduli. Using the sum of these as a rough estimate 
of the appropriate modulus in equation (28), k 2 is found 79 to be fairly 
constant, as shown in Table 2. Ideally, of course, the temperature 
coefficient of ((7 44 + (C'n C l2 )/2) should be used in equation (30) in 
determining k l9 but the necessary measurements do not appear to have 
been made. We might expect the value of (////^o)' to be less sensitive 
to the type of modulus chosen than is ju itself, so that the calculation of 
&! may not be seriously affected by the modulus we employed not being 
the most appropriate. k 2 , which requires a knowledge of ILL and (/V/^o)' 
may be more sensitive to choice of correct modulus and a more careful 
study of the problem may reveal a modulus which gives more constant 
values of k 2 than does the one we have used. 

Equations (28) and (29) provide a useful working rule for estimating 
those self diffusion coefficients of metals which have not yet been 
directly measured. 

(ii) Chemical Diffusion It might be assumed that equation (24) 
applies equally well, just as it stands, to diffusion of the solute in a 
dilute solid solution, where the mutual interaction of solute atoms may 
be neglected. However, equation (24) is based on the assumption that 
successive moves of a diffusing atom are independent of one another 
and for chemical diffusion such independence does riot necessarily hold. 
It does not hold for example when diffusion occurs by migration of 
solute atom vacancy molecules, as JOHNSON 86 suggested would take 
place if vacancies were more readily formed near to solute atoms than 
elsewhere. However, ZENER has analysed this case and concludes that 
equation (24), unchanged, is still applicable. But a more extensive 
investigation of other possible situations would be very welcome. 
For example, if, when vacancies are formed equally readily on all 
sites so as to be distributed at random in the lattice a vacancy exchanges 
much more readily with a solute atom than with a solvent atom, 
then when a vacancy meets a solute atom it will exchange it with 
it a large number of times before moving away again : successive moves 
of a solute atom are not then independent of one another and an analysis 
of this situation 86 suggests that the equation should bo multiplied by 
some factor of the order of magnitude of the co-ordination number. 

However, if we calculate values of AS, using the equation AS 
= R log (D /a 2 v) from the extensive available data on diffusion of 
solute atoms in dilute solid solution, we find 81 the great majority of them 
are negative and often numerically very large. It seems unlikely that 
refinements to equation (24) would appreciably change this finding and, 
in fact, the one we have mentioned would produce even more negative 
values of AS. Now we have seen that negative values of AS are 
inadmissible so it appears that very many reported values of D for 
chemical diffusion in dilute solid solutions are smaller than can be 
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accounted for by an extension of the theory which so successfully 
accounts for values of D in self diffusion. 

It is important to note that, very often, the lower the experimental 
values of Z> for the diffusion of a series of solutes in the same solvent 
the lower also is the value of Q, relative to the self -diffusion activation 
energy of the pure solvent. If the unaccountably low values of Z> are 
low because of experimental errors in the measurements, this correlation 
with Q values suggests that such errors may arise from the use of poly- 
crystalline material in the measurements. Grain boundary diffusion 
processes, by contributing appreciably to and so increasing above the 
true lattice diffusion values the low temperature measurements of D will 
lead to abnormally low values of Q and at the same time of D Q as 
discussed more fully on page 286, and these errors will be related to 
one another because both arise from the same cause. We may obtain a 
rough estimate of this relation as follows. Values of D are most likely 
to represent true lattice diffusion at high temperatures and certainly 
near the melting point T M . We may, therefore, assume that the 
straight line representing log D for true lattice diffusion plotted against 
l/T, may be obtained from the "best straight line" drawn through 
experimentally obtained high and low temperature values of D by a 
clockwise rotation about the point where it passes through T M . It is 
then readily shown that log D Q and Q derived from the experimental 
straight line will differ from the true lattice diffusion values by amounts 
A log I) and AQ, which are related by 

&Iog(Dt/a*v) = bQ/RT M .... (34) 

Relation (34) is most interesting in connexion with a correlation 
observed by DiENES 87 127 between D Q and Q. DIENES showed that if 
experimental values of log (D Q /a 2 v) for both self diffusion and chemical 
diffusion in dilute solutions are plotted against QjRT M the points all 
lie, although roughly, along a line of slope one and so of equation 

log (A)/^) = QIRT M - constant .... (35) 

It can be shown 79 that because of a loose correlation between (///// )' 
and l/T M this equation may be derived with the same value of the 
constant as a very rough approximation to our previous expression for 
log (D Q /a?v) : but our theory does not allow the negative values of 
R log (D Q /a 2 v) = A$, which this equation claims to cover. It is clear 
then why the experimental values of D and Q for self-diffusion lie 
near the line of equation (35). The very extensive range of values of 
D /a 2 v, covering about nine cycles of ten, which are included in the 
correlation, tends to mask the fact that many points lie away from the 
line by amounts which correspond to errors of a factor as great as 10 in 

311 



PROGRESS IN METAL PHYSICS 

the values of D /aV If then we take equation (24) as valid for chemical 
diffusion in dilute solid solution and assume that, say, equation (33), 
even approximately represents the value of A$ for chemical diffusion, 
then since A/S = R log D Q /a 2 v is proportional to Q and since further 
(/V/*o)' for the alloy may not differ much from that for the pure solvent, 
we might expect the true values of D Q to give points fairly close to 
DIENES' line of slope 1. But even if the experimental values of Z> and Q 
are in error, for reasons such as mentioned above, so that these errors 
are related by equation (34), then by virtue of this equation they will 
continue to lie near the line of slope 1 when plotted as log (D /a 2 v) 
against Q/RT M , but in a region removed from that corresponding to the 
true values of Z> and Q by an amount which is larger the larger the 
errors in Z> and Q. 

The theory of chemical diffusion even in dilute solid solution is by 
no means complete and the above discussion of it in consequence very 
approximate, but it does appear from this work that many of the 
reported values of D and Q are not truly representative of lattice 
diffusion. That Z> and Q are roughly correlated in the way shown by 
DIENES does not necessarily mean that all values of Z> and Q are at 
least approximately correct and that the correlation is a true conse- 
quence of lattice diffusion, for it is as readily explained if the departures 
of DQ and Q from the values for true lattice diffusion are related in a 
way approximately represented by equation (34). One such way as we 
have seen is through the operation of grain boundary diffusion processes, 
but any effects which produce enhanced overall diffusion at low tem- 
peratures, by providing what ZENER SI calls "short circuiting paths," 
with a lesser tendency to do so at high temperatures, will have a similar 
influence on reported values of D and Q. Many of the complicating 
features associated with the Kirkendall effect are just such as might 
be more important at low temperatures. Thus for example the lattice 
distortion accompanying the shrinkage and expansion which occurs in 
chemical diffusion and which will favour increased diffusion rates, will 
persist more readily at low than at high temperatures. Both Kirkendall 
effects, through the influence of these features, and grain boundary effects 
in polycrystalline material occurring even in the absence of a Kirkendall 
effect, may lead to values of Q and D which do not represent true 
lattice diffusion. The latter may be avoided by using single crystal 
material or, if polycrystalline, by working over a temperature range 
such that for the grain sizes used, grain boundary diffusion is negligible, 
but the Kirkendall effects are to be avoided only by working with very 
low concentration gradients, or with zero gradients as in chemical self- 
diffusion studies. 

Evidence that these considerations may be quite important is 
provided by recent measurements at extremely low concentrations of 
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Au (0-001 per cent) in Cu 88 and of Co in Ni, 89 for these both yield values 
of DQ which correspond to positive values of AS, even agreeing roughly 
with the theoretical values given by equation (33). This is very far from 
true for the many dilute solution values of Z> for other elements 
diffusing in Cu and which have been measured with larger concentration 
gradients. The few results of chemical self -diffusion coefficient measure- 
ments and of measurements of diffusion coefficients by the anelastic 
methods discussed in Section IV, both of which employ chemically 
homogeneous materials, also lead to positive values of AS. It is also of 
great interest that Q for Au in Cu is quite close to that for self-diffusion 
in Cu in contradistinction to a general result that Q for low concentra- 
tion solute diffusion in a solvent is always appreciably less than that for 
self-diffusion in the pure solvent (see PMP.l). This one result throws 
doubt on this commonly accepted generalization which was the basis of 
JoHNSON's 86 solute atom-vacancy "molecule" diffusion mechanism. 
Further investigation may possibly show it to have been a consequence 
of Kirkendall or concentration gradient effects, absent in self -diffusion, 
introducing errors into chemical diffusion measurements even at quite 
small concentrations of a few per cent. 

Many of these matters have been discussed at length by NowiCK, 90 
who also makes use of the foregoing theory of D to calculate lattice 
values for D Q and Q from those reported measurements of D in various 
low concentration binary systems which were made at the highest 
temperature used in the investigations, and so are most likely to 
represent true lattice diffusion behaviour. Using ZENER'S approximate 
equation (33) we have for D 

D - m 2 exp (0-55${///// }'AR) exp (- Q/RT) (36) 

Hence a single high temperature value of D serves to provide a value 
for Q which, as NOWICK points out, even if equation (36) is correct only 
to an order of magnitude, will be more accurate than obtained from a 
conventional plot of log D ~ 1/T, for such plots often yield values of Z> 
which appear to be in error by factors as great as 10 8 . In this way, 
using values of (/V/^o)' f r the P ure solvent, he finds that the recalculated 
activation energies for the diffusion of a series of solutes in the same 
solvent are, in fact, all much closer (within ~ 15 per cent) to that for 
self-diffusion in the pure solvent than the reported values. Also the 
solute activation energies appear often to be greater than the solvent 
self-diffusion activation energies for those solutes which according to 
KOSTER and RAUSCHER 91 increase the elastic modulus of the pure 
solvent and to be less for those solutes which decrease the modulus, as 
might be expected from equation (28). From these values of Q NOWICK 
also calculated the corresponding values of Z> from equation (33) but 
these are fnuch more sensitive to any inaccuracies in equation (33) than 
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are the Q to inaccuracies in equation (36), as the putting of appropriate 
quantities into this equation when solved for Q will show. 

Little has been said about the applicability to chemical diffusion of 
the theoretical expressions for A$ we have discussed for self diffusion. 
While equation (30) may be formally applicable it is, particularly in the 
absence of a clearer knowledge of the mechanism of chemical diffusion, 
more difficult to know what values to insert for the various quantities 
it contains. Such matters, and to what extent equation (33) is still valid 
as an approximation to (30) remain for further study, but it is to be 
noted that when experimental measurements lead to positive values of 
AS, these agree, at least within an order of magnitude, with the values 
given by equation (33). 

SECTION IV 

EXPERIMENTAL METHODS 

The various conventional experimental methods by which chemical 
and self-diffusion coefficients in the various metals and alloys have been 
measured have been adequately reviewed elsewhere, 1 * 50 ' 55 92 together 
with the mathematical equipment required for their interpretation. 
These generally require the measurement of the rate at which an 
inhomogeneity in chemical composition, or in content of radioactive 
atoms, is changed by the process of diffusion and a compilation of the 
data accumulated by these methods has recently been published in 
SMITHELLS' Metals Reference Book. 78 It is to be emphasised, however, 
that much of this is probably of uncertain theoretical significance for 
many reasons, discussed either in PMP.l or in the present chapter. 

That diffusion does take place can only mean that single atoms can 
migrate from one site in the lattice to another and in theoretical inter- 
pretations of diffusion coefficients the rate of this migration is regarded 
as the fundamental quantity. But any property of a metal which can 
be considered to depend upon the migration of atoms may in principle 
be employed to measure the rate of migration and so to provide informa- 
tion on diffusion. 

Thus, for example, ALEXANDER, DAWSON and KLiNG 93 have studied 
the variation with temperature of the creep rate of gold wires under load 
and assuming this to occur by the vacancy mechanism of creep suggested 
by NABARRO IG have calculated from the results a value of 48 kcal/mole 
for the activation energy for gold self -diffusion. Using a more refined 
theory due to HERRING 94 they obtain a value of 50 kcal/mole, agreeing 
better with the directly measured value of 53. 83 

But two properties in particular have been made use of recently to 
determine diffusion coefficients and these provide several advantages 
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over the conventional methods of measurement. These are, certain 
types of internal friction in metals and their ability to sinter. 

1 . Sintering Method for Measuring Self -diffusion Coefficients 

KuczYNSKi 95 " 98 and others have studied the rate of sintering of 

spherical particles to a flat plate of the same material or of a fine wire 

to the surface of a cylinder on which it is wound, when the two become 

bonded together in a way shown schematically in section in Fig. 11. 

Various processes may be considered as responsible for the obvious 

transfer of material that takes place. If this is to be by volume diffusion 

then KuczYNSKT 95 supposes that, just as 

the equilibrium vapour pressure over a 

convex surface of a liquid is greater than 

that over a plane surface so, assuming a 

vacancy mechanism of diffusion, is the 

equilibrium vacancy concentration over the 

highly convex surface just inside the neck 

or bond in Fig. 11 greater than that pre- 

vailing elsewhere. This difference in con- 

centration is such as to cause vacancies to Fi ' 1 . 1 ' Action of Wiro OP 
n r t , , -. . , r Spherical Particle sintered 

flow away from the neck and, therefore, to flat surface (Schematic)" 
atoms into it, enabling the bond to grow in 

width. With such a model for the growth of the bond between particle 
or wire and plate KUCZYNSKT obtains expressions from which the self- 
diffusion coefficient can be calculated. For a particle bonding to a plate 




D = kTx*/40yd*ta 2 .... (37) 

and for a wire bonding to a plate or cylinder of large radius 

D = 3kTx 5 /$0<7ryd*ta 2 .... (38) 

where k is Boltzmann's constant, T the absolute temperature of anneal, 
y the surface energy of the metal, <5 the interatomic distance, and a the 
radius of the particle or wire. The experiments require a measurement 
of the width 2x of the bond formed after a time of anneal t. This is 
done by microscopic measurements on samples suitably mounted and 
sectioned. The necessary dependence of # 5 on t demanded by equations 
(37) and (38) has been confirmed, at least for not too small sized particles 
(> 15/^), by experiments on silver 95 * 97 and on copper 95 ' 96 > 98 and the 
measurements used to calculate the lattice self-diffusion coefficients for 
these two metals. 

Since x 5 is proportional to t it is important to obtain an accurate 
value for 2x. But the present methods of experiment seem to give a 
considerable spread of values for 2x from particle to particle or in 
different f>arts of the wire when this technique is used, and a mean value 
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must be taken for a large number of observations. This must detract 
from the accuracy of the measurements, and for this reason, because of 
the approximate nature of the derivation of equations (37) and (38) 
and because of uncertainties in the value of y, it is surprising that the 
results for silver should agree so closely with those measured by conven- 
tional radioactive methods. Thus KUCZYNSKI finds for silver, 97 using the 
wire and cylinder method, which is reported as being the most convenient 
and accurate, D 0-9 exp ( 45,700/JfZT) compared with JOHNSON'S 
results, 31 recently well confirmed by HOFFMAN and TuRNBULL, 29 of 
D = 0-89 exp (45,900/#y). 

The results for copper are in rather less good agreement but still 
fairly close to those obtained by conventional radioactive methods. 

KuczYNSKi 95 considered that the transfer of matter was definitely by 
volume diffusion for his examination of the other possible processes of 
viscous or plastic flow, of evaporation and condensation and of surface 
diffusion suggested that these require the relation between x and t to 
be, respectively, x 2 ~ t, x 3 ~ t, and x 1 < ' t, whereas the experiments 
clearly produced a fifth power law consistent only with the supposition 
of volume diffusion. However, both CABRERA" and SCHWED IO have 
criticized KUCZYNSKI'S treatment of surface diffusion and pointed out 
than an a: 5 - < t law should hold for both surface and volume diffusion : 
SCHWED further suggests that for very small particles the law should 
become a: 3 ^ t. The experiments on silver and copper seem though to 
support the view that sintering is in fact by volume diffusion and not 
surface diffusion, at least for the particle and wire sizes used, particularly 
because the activation energies are very close for silver and for copper to 
those found for volume diffusion by more direct methods. 

The method is one of promise and seems capable of providing quite 
accurate values for D, although it has not yet been fully developed 
experimentally : another way of measuring the width of the bond which 
has been suggested 101 is to follow the change in contact resistance between 
two metallic spheres or hemispheres as these sinter together. It would 
benefit from a more thorough theoretical treatment of the sintering 
process. It has the great advantage over conventional methods in that 
it could be used to measure the self-diffusion coefficients of metals for 
which no suitable radioactive tracers exist, such as aluminium and 
magnesium. 

2. Anelastic Methods 

In both interstitial solid solutions in b.c.c. metals 102 and substitutional 
solid solutions in f.c.c. metals 103 there is a type of anelasticity which 
has been interpreted as arising from a redistribution of solute atoms in 
the presence of an applied stress, and which is, therefore, intimately 
related to the diffusion properties of the solute atoms. Measurements on 
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this type of anelasticity have recently been employed to measure the 
diffusion coefficients of solute atoms in both these types of solid solution. 

A solute atom in an octahedral interstitial position will produce a 
distortion in the surrounding lattice with tetragonal symmetry, the 
tetragonal axis being along one of the three (100) directions. 102 In 
the absence of any applied stress the tetragonal axes of distortion will 
be distributed equally among these three (100) directions, but if a 
tensile stress is applied along one of these directions atoms will tend to 
move into those positions for which the tetragonal axis is along the same 
direction, for the stress is then to some extent relieved. A certain time 
will be required for the establishment of the new stress induced preferen- 
tial distribution of atoms so that the strain will lag behind the stress. 
This hysteresis will give rise to relaxation and other anelastic effects 
such as internal friction. 

In substitutional solid solution isolated solute atoms produce a local 
distortion with cubic symmetry so that no redistribution occurs on 
applying a stress. But in a random distribution of solute atoms there 
will occur pairs of solute atoms which are nearest neighbours to one 
another and the local distortion produced around such pairs will be of 
tetragonal symmetry. On applying a stress the pair axes will tend to 
orient themselves along those crystallographically permissible directions 
which make the least angle with the direction of stress, for by so doing 
the stress is to some extent relieved and the strain energy reduced. 
A pair reorientation is achieved by the migration of either member of 
the pair to a mutual nearest neighbour position, and it is to such a 
process that these particular internal friction effects, observed only in 
alloys, have been attributed. 

Clearly, one would expect the relaxation strength, or the magnitude 
of the internal friction, to be greater, for equal solute concentration, 
for interstitial solid solution in b.c.c. lattices than in f.c.c. substitutional 
solutions, for in the latter case it is only the randomly occurring pairs 
of atoms which contribute to the anelastic effects. Thus, these may be 
observed with very low concentration of for example C in a-Fe 
(001 per cent) whereas a concentration greater than ~ 10 per cent is 
necessary with substitutional alloys. The larger the difference in size 
between solute and solvent atoms the greater the relaxation strength 
and so the lower the concentration at which observable effects are 
obtained. But, at these high concentrations the concept of the reori- 
entation of pairs requires considerable modification, for much larger 
groupings of solute atoms will occur and isolated pairs will be relatively 
uncommon. However, calculations based on the simple pair concept do 
seem to provide from experimental measurements, values of D which, 
when extrapolated to the higher temperatures at which conventional 
diffusion Coefficient measurements are made, agree fairly well with the 

317 



PKOGRESS IN METAL PHYSICS 

values the latter give. This, and other evidence, confirms that the 
internal friction effects are intimately associated with the diffusion 
properties of the alloy, but a more careful theoretical treatment would 
be welcome. The anelastic effects in the b.c.c. interstitial solutions are 
more clear in origin and free from such difficulties of theoretical 
interpretation. 

These anelastic effects 104 are characterized by the "relaxation time" 
r n required for the establishment of the new stress induced equilibrium 
redistribution of atoms or pairs of atoms and r R is clearly closely related 
to r, the mean time of stay of any one atom in one lattice position. The 
diffusion coefficient D for the solute atoms is given by equation (13), so 
that a determination of T R serves to determine D. 

For interstitial solid solution 126 * 57 in b.c.c. lattices r = 3r /e /2 and 
for substitutional solid solution in f.c.c. lattices it is sufficiently accurate 
to take r = T A > for polycrystalline material, although for single crystals 
the relation between r and T R varies with the direction of the stress. 105 

T R may be determined by measurements either of the internal friction 
or of the elastic after-effect. 104 

The first method consists in measuring the temperature at which the 
maximum occurs in a plot of internal friction as a function of tempera- 
ture. If a> is the vibrational frequency at which the experiments are 
being conducted then the r n at this temperature = l/d5. Measurements 
of T R at other temperatures are made by varying the frequency cD. 
With the specimen as a wire forming the suspension of a torsional 
pendulum and with photographic recording of the vibrations, measure- 
ments may be made in the range ~ 1 to 100 c.p.s. For higher frequencies 
and, therefore, also temperatures, it is more convenient to study the 
damping of a bar of material set into flexural oscillations, using 
electronic methods of timing. 

In the second method, applicable at lower temperatures and for 
values of r R greater than a minute or so, the specimen in wire form is 
twisted and kept so for a time of the order of T R . On releasing the 
stress the elastic strain is recovered immediately, but the anelastic 
strain only gradually and a measure of this strain as a function of time 
determines r R . 

Using both methods values of r R from about 1 hr to, say, 10~ 4 sec 
(for a> ~ 1(}K c.p.s.) may be measured covering 8 cycles of 10 in the 
value of D. The corresponding temperatures are such as to extend the 
temperature range over which diffusion coefficients may be measured 
from the comparatively restricted range covered by the conventional 
methods right down to below room temperature for C in a-Fe, for 
example, 57 and to within about 200C of room temperature for alloys 
such as CuZn 106 and AgZn. 107 

This very considerable advantage of the method means that very 
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accurate values of D and Q may be obtained, which, as mentioned on 
p. 313 give positive values for the entropy of activation. Kirkendall 
or concentration gradient effects are without influence on the results for 
measurements are made, of course, on chemically homogeneous material, 
and, therefore, also provide values of D at well-defined compositions. 
Grain boundary diffusion plays no part in determining r R but, at least 
for the substitutional alloys, single crystal or quite coarsely poly- 
crystalline material must be used to avoid or reduce anelastic effects 
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associated with grain boundaries such as grain boundary slip, 108 which 
might otherwise contribute more appreciably to the observed anelastic 
effects than the process being studied. 

For substitutional solid solutions the method has certain limitations. 
The solute and solvent atoms must differ appreciably in size and the 
concentration must not be so low that the number of solute atom pairs is 
inappreciable. 

These methods have been employed most extensively to interstitial 
solid solution, in particular of C 126 102 56 > 57 and N 102 ' 56 58 in a-Fe and 
of 2 , 110 N 2 , m and C 110 in Ta and have provided 56 " 58 the accurate values 
of D Q and Q made use of in the work discussed in Section III. Fig. 12 
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shows the results for C in <x-Fe largely by WERT, 57 together with those of 
STANLEY 109 measured by the conventional methods. The extension of 
the measurements to lower temperatures clearly provides for an 
accuracy in D and Q quite unattainable by the conventional methods 
alone. 

Comparatively little use has yet been made of the method with 
substitutional solid solutions. NowiCK 107 has studied the diffusion of 
Zn (15 per cent) in Ag and LE CLAIRE and CniLDS 106 of Zn (10-30 per 
cent) in Cu and in each case the results give a very accurately linear 
relation between log D and I/I 7 , which when extrapolated to high 
temperatures give values of D agreeing fairly well with those obtained 
by conventional methods. Earlier measurements 108 * 122 of the internal 
friction in 30 per cent Zn a-brass have since been made use 106 of to give 
values for D. An interesting result of the CuZn experiments is that 
between 15 and 30 atomic per cent Zn, the activation energy is fairly 
constant at 38,000 kcal/mole, contrary to previous measurements 
which show a marked variation. D increases with composition but not 
so rapidly as indicated by RHINES and MEHL. 123 

An interesting use of this method stemming from the advantages of 
the short times in which and the low temperatures at which diffusion 
coefficients may be measured by it, has already been described (p. 303). 

These advantages arise, of course, because the internal friction 
methods of measurement only require the diffusion of atoms over 
distances of the order of the atomic spacing, whereas the conventional 
methods require diffusion distances of ~ I/mm or so. In order to ob- 
tain the latter in a reasonable time, experiments have to be conducted 
at high temperatures where the jump frequency is large. 

The internal friction methods are therefore of importance in any 
study of the effect on diffusion rates of any condition of the material 
which is appreciably influenced by the temperatures at which conven- 
tional measurements are made, in a time short compared with that 
required for their execution. Such conditions are the presence of cold 
work and of radiation damage as well as of a non-equilibrium vacancy 
concentration. 

SECTION V 

THERMODYNAMIC AND KINETIC THEORY OF CHEMICAL DIFFUSION 

Darken } s Equations 

DARKEN, 8 it will be remembered, in order to provide a phenomenological 
description of binary diffusion which would embrace the Kirkendall 
effect, proposed that a separate and independent diffusion coefficient 
should be ascribed to each of the two diffusing species defining their 
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respective rates of flow relative to a system of markers fixed in the 
system. It is usual to assume that the markers define lattice planes so 
if we generalize DARREN'S proposal, the rates of flow J t relative to the 
lattice of the several species of an n component system are supposed 
given by 

J. = DJcJdx .... (39) 

In the case of a binary alloy these equations lead, subject to the 
previously discussed assumptions, to equations (1) and (2) of Section I. 

Now equation (39) implies that the necessary condition for diffusion 
to occur between two points is that there exists a concentration 
gradient between them and assumes the simplest mathematical expres- 
sion of this fact by putting the rate of flow proportional to the concen- 
tration gradient. The constants of proportionality D i then characterize 
the diffusion behaviour of the system. Although this is a reasonable 
deduction from experience, it is thermodynamically more sound to 
expect that the essential condition for diffusion of any one species to 
occur between two points is that there should exist between them a 
difference in the chemical potential of that species, for a system is only 
in equilibrium when the chemical potential of each species present is the 
same throughout the system. The simplest mathematical expression 
of this fact is to assume the rate of flow of any one species to be propor- 
tional to the gradient of its chemical potential fa. The rates of flow of 
the several species are then to be given by the equation 

.... (40) 



The diffusion behaviour of each species may now be specified by the 
corresponding coefficient L i but it is more common practice to retain 
equation (39) as a definition of the diffusion coefficients and to express 
each D i in terms of the quantities appearing in equation (40). In this 
way we find that (see PMP.l) 

T '. I ?l \r\ct *> \ 

.... (41) 



c, 3 log c l 

where use has been made of the usual expression, // t - const + 
log c t y t for //, in terms of the activity coefficient y t .. The term LJc { 
corresponds to the "mobility" G l in terms of which the phenomenological 
theory of diffusion was discussed in PMP.l. 

These phenomenological equations (40) and (41) are frequently used 
in preference to equation (39) in dealing with diffusion, particularly in 
liquids, and although their use in describing metallic diffusion processes 
has been suggested from time to time in the literature, little application 
appears to have been made of them until the work of BIRCHENALL and 
MEHL 112 and of DARKEN. S 
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The first important application of these equations, by DARKEN, was 
to provide a relation between the individual chemical diffusion coeffi- 
cients D { and the corresponding self-diffusion coefficients J9,*. For this 
purpose DARKEN assumes that the mobility LJc i9 defining the average 
velocity of flow by diffusion under unit free energy gradient (see 
PMP.l), is the same, for any one chemical species, under condition of 
both self-diffusion and chemical diffusion, so that differences between 
D i and Z\* arise entirely from differences in the chemical potential 
gradient in the two cases. In self diffusion the chemical potential 
gradient involves only the entropy of mixing so that 

D t * = ETLJe { .... (42) 

With the above expression for D iy we then find 

J5, = <*(! + 3 logy, /3 log c<) .... (43) 

relating D l and D*. 

For the special case of a binary diffusing system we may now relate 
D a * and D 6 * with the chemical interdiffusion coefficient D and the 
rate of marker movement v (equations (1) and (2)) by making use 
of the Gibbs-Duhem relation for a binary system. 3 log y a /3 log c a 

= a log y 6 /a log c b . 

Thus 

D=(N a D b * + N b D a *)(l + 3\og 7a p\ogc a ) .... (44) 

and v = (D a *-D b *)(l + 3\og ra plo g c a )3c a px .... (45) 

In support of equation (44) DARKEN quoted the earlier results of 
JOHNSON, 13 who measured the self-diffusion coefficients D Au * and D AK * 
of gold atoms and silver atoms respectively in a chemically homo- 
geneous alloy of 50 per cent composition, and also the chemical inter- 
diffusion coefficient D at the same composition. For this latter purpose a 
diffusion couple was used the compositions of the two halves of which 
differed only slightly from their mean of about 50 per cent. DARKEN 
showed that JOHNSON'S values of D a *, D b * and D were satisfactorily 
related by equation (44) with N a = N b = -| and with the already known 
thermodynamic data on Au/Ag alloys being used to calculate the last 
factor. Equations (39), (40), (41) and (43) as well as (1) and (2) are 
generally referred to in the metallurgical literature as Darken's equa- 
tions, although most of them appear to have been proposed indepen- 
dently by workers in other fields. 114 

More recently LE CLAIRE and BARNES 19 have pointed out that the 
marker shift actually observed in Au/Ag alloys by CORREA DA SILVA 
and MEHL S agrees quite well with that calculated from equation (45), 
using JOHNSON'S measurements of D a * and D b *. This calculation 
required a rough estimate of the value of dcjdx in CORREA* DA SILVA 
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and MEHL'S experiments which was not reported by them. SEITH et 
al 116 ' 128 actually measured the marker shift in a Au/Ag couple identical 
with the one used by JOHNSON and used his reported data to give the 
exact value of dcJSx. Their observed shift of 2-90 . 10~ 3 cm agrees 
fairly well with the value of 3- 10 . 10~ 3 cm calculated from equation (45). 

These measurements on Au/Ag alloys do then appear to provide a 
fairly satisfactory check of DARREN'S equations but more extensive 
experimental work of a similar nature would be very welcome and, 
indeed, necessary to establish the equations firmly. A case might be 
made that any discrepancy between experiment and calculation based 
on Darken's equations was to be attributed to uncertainties in the 
measured value of the chemical diffusion coefficients arising from 
porosity and lateral changes in the dimensions of diffusion samples as 
discussed in Section I. Thus SEITH and KoTTMANN 128 point out that the 
observed value of v in their Au/Ag samples is less than the calculated 
value, as is to be expected if the discrepancy arises from these causes. 

The relation between D t * and D i is important for many reasons. It 
provides the essential link between the theoretically most simple and 
fundamental quantity underlying all diffusion phenomena, namely the 
average number of random jumps made per second by an atom in a 
homogeneous alloy, derived from D t *, and the practically significant 
rates of chemical diffusion given by D t . The form of the relations will, 
in general, vary with the mechanism of diffusion and so a choice in 
principle may be made between proposed mechanisms by experimental 
study of the corresponding relations (see PMP.l). 

A number of recent papers 15 ' 59 20 120 m have been concerned with 
the validity of DARKEN'S equation and of the assumptions upon which 
they are based. One important feature omitted from DARREN'S treat- 
ment of diffusion is the part played by the vacancies, when diffusion 
occurs by this process. There is a definite nett flux of vacancies as the 
Kirkendall experiments show and it is natural to inquire to what extent 
this vacancy flow affects the validity of equations (39) and (40). It is by 
no means apparent why, as equation (40) states, the rate of flow of 
atoms of one species should be proportional only to the chemical 
potential gradient of atoms of that species when there is a nett flow of 
the vacancies whose movement alone can lead to a flow of the atoms, or 
to put it another way, when there is such an intimate coupling between 
the flow of atoms and of vacancies. 

A study of the problem is conveniently made in terms of ONSAGER'S 
general theory of irreversible phenomena. 116 " 18 

General Thermodynamic Theory of Diffusion 

The essential assumption is that the rate of flow J i of each of the 
several K species present in a multicomponent system is linearly 
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dependent upon the chemical potential gradient of each of the species 
present, including the vacancies, so that we have as the "thermodynamic 
equations of motion" n 

Jt^ZLtjX, .... (46) 

j-i 

where for short we write Xj as the chemical potential gradient of 
the Jth species, Xj = d/^Jdx. 

The terms L u correspond closely to the terms L t in equation (40) 
and, therefore, to the diffusion coefficients but the "cross-terms" 
L ti (i ^= j) take account of possible couplings between the several 
diffusion flows. It may be shown 116 that as a consequence of the principle 
of microscopic reversibility, a symmetry exists between these cross 
terms in equation (46) expressed by the "Onsager Relations." 

.... (47) 



(the same is not in general true when equation (46) is rewritten with 
dcjdx in place of 3/tJdx). 

The L ti are functions of composition and will in general vary accord- 
ing to the choice of plane with reference to which the J ' i are measured. 
Let this plane be such that the nett flow of all species across it is zero. 

Then 2', = <> .... (48) 

Since for each atom moving from one lattice plane to the next there 
is a vacancy moving in the other direction, equation (48) is satisfied 
for a plane fixed relative to the lattice, as required for comparison of the 
J t given by equation (48) with those given by equation (39) or (40). 

In addition to the "Onsager relations," equation (48) leads to a further 
reduction in the number of independent coefficients L Tj for it may very 
simply be shown 117 to lead to the result 

2 = .... (49) 



.2 L = (50) 

Let us now regard the vacancies as the nth component and put n v. 
With the aid of equation (49) we may now rewrite the equation (46) 
for J ' i as n _ l 

J i 2 L^Xj + L iv X v 



= ZL ti (X,-X 9 ) .... (51) 

j-i 

where X j and X v are the chemical potential gradients of the jth 
component and of the vacancies respectively. 
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The equations (51) give in the most convenient general form the rates 
of flow by diffusion of the vacancies and of the n 1 atomic species. 

If the density of vacancies at each point in the system is such as 
corresponds to them being in equilibrium with the lattice, then the 
free energy of the vacancies is a minimum for small changes in their 
number, so that /^ and also X v 0. Equation (51) now becomes 

J t = n 2L iS X, .... (52) 

j=i 

If further we neglect all cross terms L^ we arrive at 

/, = *, .... (53) 

Neglecting cross terms but not assuming the vacancies are in equili- 
brium we find 

J t = L ti (X t -X,) .... (54) 

Equation (53) proposes that each J i is determined solely through the 
corresponding X t . Such phenomenological equations are identical with 
equation (40) on page 321. We now see that their validity rests on the 
assumption that the vacancies are everywhere maintained in thermo- 
dynamic equilibrium with the lattice and that the cross terms, which 
give a contribution to the flow of each component proportional to the 
chemical potential gradient of each of the other components, are 
negligible. 

It may well be in many cases that the vacancies are everywhere in 
thermal equilibrium so that at least equation (52) is valid. This will 
certainly be true in self-diffusion experiments, where chemical homo- 
geneity is maintained arid also in chemical diffusion experiments if there 
are sources or sinks for vacancies, such as dislocations, adequate to 
maintain equilibrium. 

As regards the cross terms, at least those representing coupling 
between the flow of vacancies and the flows of the several atomic species 
are not to be neglected, as is to be expected from the very nature of 
vacancy diffusion. The nett flow of vacancies is certainly not zero as the 
Kirkendall effect shows and being given by 



may be regarded as determined entirely by the cross terms L vj ( j < n I ) . 
The fuller and more general phenomenological theory expressed by 
equation (46) is then necessary because of the observed nett vacancy 
flow. 

But no convincing demonstration has yet been given of the part 
played in vacancy diffusion by the cross terms between different 
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atomic species, and in particular of whether they are zero or sufficiently 
small to be neglected. Obviously, for an exchange or ring diffusion 
mechanism, the cross terms would certainly be significant. 

It is to be noted that a simple diffusion coefficient D i as given by 
equation (39) can only in general be defined for each species when cross 
terms are neglected and equation (53) is valid. With non-zero cross 
terms then it is not possible in an n component system to obtain any 
equation corresponding to equation (43). 

However, in the simple but important special case of binary diffusion 
the rate of flow of each species may, when the vacancies are in equili- 
brium, be written as proportional only to its own concentration gradient 
even when cross terms are not ignored. We may, therefore, obtain 
diffusion coefficients for the two species, so that the use of DARREN'S 
equations (1) and (2) is consistent with this more general theory. But 
DARREN'S relation (43) and, therefore, (44) and (45) only follows when 
cross terms are zero. These results are simply shown as follows. 

In a measurement of the self-diffusion coefficient of one component 
of a binary alloy we are dealing with a system of four identifiable 
species, inactive A atoms, active A* atoms, inactive B atoms, and 
vacancies. Denoting these four species respectively by suffixes 1, 2, 3, 4 
the "thermodynamic equations of motion" are 121 



J 1 = L ll X l + ^12^2 + ^13^3 ~t~ 

^2 ^21^1 4~ ^22^2 ~f~ ^23^3 4~ 

<^3 ~ ^31^1 ~t~ ^32^2 4~ As3^3 + 

^4 = ^41^1 + ^42^2 + ^43^3 + 



(55) 



A self-diffusion coefficient measurement requires a determination of 
J 2 in a chemically homogeneous alloy of A and B, i.e. in which X 3 
and Cj -f~ C 2 c a> ^ ne uniform and constant total concentration of A 
atoms. If we assume the vacancies are equilibrium X^ = and 

/ 2 = L 2 iX l ~\- L 22 X 2 .... (56) 

The Gibbs-Duhem relation in the form ^c i X i enables us to 
eliminate X l from (56) and so obtain J 2 in the required form, from which 
we find for the self-diffusion coefficient of A atoms in the alloy D a *, 

t .... (57) 

The further condition that J x + J 2 = and J 4 lead to the relations 

c 2 L 31 qZ^ .... (58) 

and 

c 2 (L n + L 2 ,)- ^(L 22 + L 12 ) = . .* . . (59) 
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Use of (59) enables us to write D a * in the more symmetric form 121 

D + = -RT (L n + L 22 - fe + - 2 ) L 12 ) I c a . . . . (60) 

\ vC 2 Cj/ / / 

A measurement of the chemical diffusion coefficient of A requires a 
determination of the total flow J l -f- </ 2 of A in an alloy in which a 
chemical concentration gradient of A exists. In this case, not the sum of 
Cj and c 2 but the ratio Cj/Cg remains constant at every point. Thus, as 
c x and c 2 are concentrations of chemically identical species 

c^ + c 2 X 2 - c a X a .... (61) 

where X a is the total chemical potential gradient of A atoms, and 

X l = X 2 = X a 

With these and the Gibbs-Duhem relation we find for D a 



D a = -RT {(L u -I- L 22 + 2L 12 )/c,, - (L 13 + L 23 )/c 3 } 

^ o log 

.... (62) 

From (60) and (62) the general relation between D a and D a * becomes, 90 
using (58), 



It is clear that this is equivalent to DARKEN 's relation between D a 
and D rt * only when ^ 2 3/ c 3 ^i2/ c i or when the cross terms are both 
zero. The first condition would mean that the diffusion behaviour of the 
two species of atoms are identical whereas the Kirkendall effect clearly 
shows that such is not the case. (The analogous equation (58) is a direct 
consequence of the identity of A and A* atoms.) This condition would, 
of course, hold for a mixture of three identical isotopes ; since the 
thermodynamic factor is unity in this case we arrive at the trivial 
result D a = D a *. 

We have seen that there is a little experimental evidence in support 
of equation (43), which therefore suggests that the cross terms may be 
unimportant. But a theoretical justification of this conclusion, indi- 
cating that cross terms are, in fact, zero or at least sufficiently small to 
be neglected, can only come through a detailed study by kinetic methods 
of the process of vacancy diffusion in binary alloys. Although several 
attempts have been made to construct such a theory, the problem is not 
yet sufficiently clearly or convincingly resolved to warrant a full 
review of them. 

It is interesting to note in passing that the cross terms do play an 
important part in the phenomena of diffusion in strong electrolytes. 113 
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Kinetic Theories of Diffusion 

BARDEEN 120 has shown how equation (54) may be derived by simple 
kinetic considerations, so that DARREN'S equation (43) again follows if 
the vacancies are in equilibrium, but the assumptions introduced into 
the analysis have been criticized by SfiiTZ, 15 who regards them as 
"exceedingly restrictive." BARDEEN and HERRING 20 in a later paper 
claim to have avoided the more important of these assumptions, with- 
out influence on the results, but their treatment is not without objec- 
tions. SHUTTLEWORTH 121 has considered a model in which one species 
of atom exchanges very much more readily with the vacancies than does 
the other, and finds a relation between D a and Z) a * which differs from 
equation (43) in that the factor (1 + 3 log y a /3 log c a ) is replaced by 
(2 + 9 log yjd log c a ) so that cross terms are clearly of importance in 
such a case. 

It would hardly be profitable in the present incomplete state of the 
subject, to describe all these various attempts at the problem in detail, 
but SEiTz's 15 ' 59 more general approach may be mentioned for it reveals 
in a fairly simple way and with the fewest assumptions, the requirement 
that the vacancies must be in equilibrium for the rates of flow of the 
two species to be expressible by diffusion coefficients. 

SEITZ was the first clearly to point out that the Kirkendall effect was 
consistent with a vacancy mechanism of diffusion if it was assumed that 
vacancies exchanged more readily with one type of atom than with the 
other. He introduces the quantity 7T a (t-rj) defining the probability that 
a vacancy on a plane where the composition is f will exchange in unit 
time with an A atom on a neighbouring plane where the composition is rj. 
A similar quantity ^(fiy) defines the corresponding probability for a 
vacancy exchange with a B atom. The total probability that in unit 
time a vacancy will jump from plane 1 to plane 2 is clearly -n a + 7r 6 , 
which sum we denote by TT. 

We suppose diffusion to occur between two neighbouring planes 
1 and 2 separated by a distance X and whereon respectively the planar 
densities of A atoms, B atoms, and vacancies are n al n a2 , n bl n b2 , n vl n v2 . 
If the total number of sites n remains constant during the diffusion and 
since n v is very small the composition on each plane is completely 
determined when only n al and n a2 are given. 

The nett flow of vacancies from plane 1 to plane 2 is 

J v == n vl 7r(n al n a2 ) - n v2 7r(n a2 n al ) 

Since n al and n a2 differ only slightly from their mean value of n a , TT 
may be expanded to the first order in X to give 
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We thus see that even when there is no concentration gradient of 
vacancies a flux of vacancies exists solely on account of the chemical 
concentration gradient. 

The flow of A atoms is clearly 



which we may write as 



If the vacancies are everywhere in thermodynamic equilibrium with 
the lattice, then the vacancy concentration is a function only of chemical 
composition and we may write 

dn v ^ dn v dn a 
dx dn a ' dx 

J a in equation (64) may now be written proportional to the concen- 
tration gradient of A atoms, so we may obtain the chemical diffusion 
coefficient D as 



where nJA has been replaced by c tt , the volume concentration of A 
atoms. 

We see then again that equation (39) and, therefore, also (1) and (2) 
are valid for binary diffusion only when the vacancies are maintained 
in equilibrium. 

It is readily shown that D a * is given by 

D a * = n v ir a (n a n a ) . A> a .... (66) 

so that the expression for D a and Z) a * are intrinsically different in form 
for D a * involves only TT,, whereas D a involves also its first derivatives. 
This represents the least objectionable justification by kinetic 
methods available so far for the use of equations (1) and (2) in describing 
binary diffusion ; it is only when simplifying assumptions are made as 
to the form of TT that a relation between TT and its derivatives and, 
therefore, between D a and J9 * may be obtained. It is, in effect, at 
this stage, in the choice of simplifying assumptions, that the various 
attempts at constructing a kinetic theory of diffusion, diverge. And 
none of these assumptions is sufficiently unrestricted to lead to a 
convincingly general result. 
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7 
NUCLEATION 

J. H. Hollomon and D. Turnbull 

INTRODUCTION 

IN a large number of chemical reactions there are no recognizable 
centres at which the reaction begins and from which it propagates. In 
contrast to this mode of reaction most phase transformations begin at 
certain sites and propagate from these places. An element of the system 
undergoes no recognizable change until it is swept over by a boundary 
between the transformed and untransformed phases propagating from 
the centre at which the transformation began. 

The initiation of transformation at certain recognizable centres is 
known as nucleation, and the propagation of the transformation from 
the centres is called growth. Even though the nucleation of transforma- 
tions has been recognized and described for at least two centuries, until 
recent years it was difficult to make reproducible and quantitative 
measurements of the rate of nucleation. During the past two decades 
semi- quantitative measurements have been made on the rate of nuclea- 
tion of liquid droplets in supersaturated vapours. These measurements 
largely confirmed the predictions of the theory of homogeneous nuclea- 
tion developed by VOLMER, BECKER and co-workers. More recently the 
rate of nucleation in supercooled liquids has been measured quantita- 
tively. Also our insight into the nature and mechanism of nucleation 
catalyzed by extraneous surfaces has been deepened by recent experi- 
mental and theoretical developments. Even though the understanding 
of nucleation in crystalline media is much less advanced than for 
gaseous and liquid media, an important theoretical groundwork for this 
understanding has been developing. 

These advances in the understanding of nucleation have spurred great 
experimental and theoretical activity in the past few years. Descrip- 
tions of some of this activity are given in recent reviews by SMOLU- 
CHOWSKi, 1 TURNBULL and HOLLOMON, IJI HOLLOMON, 2 BRADLEY, 3 and 
LA MER, POUND and REiss. 4 

In this paper we shall critically review the subject of nucleation in 
general but particularly emphasize the rate of nucleation in reactions 
involving crystalline phases. 

Consider the formation of a phase /? within a parent phase a by 
the process of nucleation and growth. If the transformation occurs 
isothermally the size of each ft region, D, is some function of time 
D = /()? If no change in composition accompanies the transformation 
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ft propagates at a constant rate, 0, after nucleation and D = G(t r), 
where r, usually a positive intercept on the time axis, is the nucleation 
period. Of course, all regions of ft do not nucleate at the same time, so to 
describe the course of the transformation we must know more than one 
value of r. It is most convenient to define the nucleation frequency, /, 
which is the number of ft regions formed per unit of time per unit of 
untransformed volume. / is reciprocally related to a mean value of 
r r. To measure / experimentally we effectively must measure 
D f(t) for a large number of ft regions at a given stage in the trans- 
formation, extrapolate all these to zero, and determine the number, JV, 
of intercepts per unit time. Then / = N/V, where F is the untrans- 
formed volume corresponding to the particular time. In order for the / 
value to be of significance N must be very large. Where a composition 
change accompanies transformation, as in precipitation, often it is 
found that the growth of ft is diffusion limited and D = G ll2 (t r) 1/2 . 
For such transformations / is determined similarly. 

The kinetics of a large number of isothermal transformations, 
probably the majority, can be satisfactorily described by the two 
parameters / and G. Knowing I and G and taking account of the 
geometrical problem of the impingement of growing ft regions, the 
dependence upon time of the fraction of the a sample transforming to 
ft may be derived. Derivations of this dependence have been given by 
JOHNSON and MsHL 5 and AvRAMi 6 (now MELVIN). These derivations 
recently have been reviewed critically by BURKE and TuRNBULL. 7 

Examples of phase transformations satisfactorily described as 
nucleation and growth processes are vapour > liquid, vapour > crystal, 
liquid ^ crystal, precipitation, allotropic transformation, and in a few 
instances disorder -> order. A variety of processes that are not phase 
transformations in the ordinary sense can also be described by nuclea- 
tion and growth kinetics. An example is the recrystallization of cold 
worked metals treated in detail in an earlier review. 7 

Nucleation that takes place at random within homogeneous elements 
of volume of a phase is designated homogeneous nucleation. Experience 
indicates that transformation nuclei often form preferentially at various 
interfaces in the system, such as free surfaces, internal surfaces, con- 
tainer walls, suspended crystallites, etc. Such interfaces are said to 
catalyze nucleation and this process of nucleation is called heterogeneous 
nucleation. 

We will describe and explain nucleation in phase transformations 
categorized according to its complexity. In many phase changes, such 
as condensation of a pure vapour or crystallization of a supercooled 
liquid, only a change in structure is involved. The crystallization of a 
phase from a solution involves a change in structure and composition. 
Volume changes accompany most phase transformations; therefore, 
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when a phase forms in a crystalline medium strain energy is developed 
due to the transformation. If no composition change is involved, as in 
allotropic transformations in pure crystals, there is structure and strain 
to be considered. The precipitation of a phase from a solid solution is 
the most complex type of transformation in that it is accompanied by 
changes in structure, composition and strain. 

STRUCTURE 
Nucleation of Liquid from Vapour 

General Theory Even though our primary purpose in writing this 
review is to describe and explain nucleation in condensed metal systems, 
it is desirable to re-examine the theory for nucleation of liquids from 
supersaturated vapours and its experimental verification. The theories 
for the nucleation of a liquid from a supersaturated vapour have 
influenced the development of theories for nucleation in condensed 
systems. 

The theory for homogeneous nucleation of liquids in vapours as now 
generally accepted is largely due to VOLMER and WEBER and BECKER 
and DoRiNG. 9 f VOLMER and WEBER supposed that dense clusters of 
molecules exist in equilibrium in the vapour and that some of these 
clusters (called embryos) may become nuclei for the growth of liquid 
droplets. The essential assumption of the theory is that the standard 
free energy, A6r, of these embryos can be calculated as though they are 
liquid droplets having the same liquid-vapour energy per area, cr, and 
the same free energy per volume, A(? v , with reference to the vapour 
phase, as the macroscopic liquid phase. Thus, for spherical embryos of 
radius, r, at a given temperature and external pressure, 

A = irr*a + (|7rr 3 )(A^), .... (1) 

where a and &G V are independent of r. 
In general, 

A = aai 213 + 6A y i, .... (2) 

where i is the number of molecules per cluster and a and b are factors 
determined by the shape and density of the cluster. 

A second assumption of the theory 8 is that embryos are born, grow 
to nuclei, or disappear by a sequence of bimolecular reactions. Desig- 
nating a as a molecule of vapour and f} t an embryo of liquid containing 
i molecules, these sequences of bimolecular processes can be represented 
in the following way : 

a + a > j3 2 
/? 2 + a</? 8 .... (3) 

A-i + a*A 
t Called hereafter the V.W.B.D. theory. 
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With these assumptions the absolute frequency of nucleation per 
volume, /, of liquid nuclei in supersaturated vapours can be derived. 

Let the number of embryos per volume containing i molecules in 
equilibrium with vapour at a given pressure P and absolute temperature 
T be n t . Then assuming ideal mixing of embryos of all sizes and that 

n i ^> 2 n i> where n is the number of single molecules per volume, it 

follows that : r \numi /A\ 

n . = HI exp [ &G/kT] .... (4) 

When A(? v < 0, A(? = f(r) (equation 
(1)) has a maximum (see Fig. 1): 

A0* = 167T(T 3 /3(A(?J 2 , .... (5) 
at 

r = r* == 2a/&G v (6) 

Embryos having a radius smaller than 
this critical value will on the average 
disappear, but those larger will grow into 
' r \ liquid droplets. An embryo having the 
Fig. 1. Free energy of an embryo critical radius is by definition a nucleus, 
of a new phase as a function The number of embryos per volume- 
of its radius time becoming nuclei is the nucleation 

frequency,! /. 
/ is the net frequency per volume of the bimolecular process : 

# _j_ a ^> # .... (7) 

where i* is the number of molecules in the nucleus. 

Assuming with VOLMER and WEBER an equilibrium distribution of 
embryos of all sizes, the number of nuclei per volume, n^', is given by : 

n ^ __ n ^ __ n exp [ AG*/kT]. .... (8) 

Then neglecting the reverse reaction indicated in (7) we obtain the 
expression : 

where .9* is the surface area of the nucleus and z is the collision frequency 
of vapour molecules with the surface of the nucleus per area. From 
kinetic theory z = P/V27rmkT and substituting (8) into (9) we have : 

/ = [P/(27rmkT) ll2 ]s*n l exp [- &G*/kT] (10) 

BECKER and CORING improved this treatment by taking into account 
the reverse process indicated in (7), and the decrease in concentration of 

f This definition does not correspond exactly to the operational definition of 7 given 
in the introduction. However, on the basis of the V.W.B.D. theory the difference in the 
two / values is negligible in most instances. 
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embryos during a transition from the equilibrium concentration because 
of the rapid growth of nuclei to droplets. 

Schematic embryo size distribution curves calculated from these two 
points of view are compared in Fig. 2. 
The expression derived by BECKER and 
DOBING is : 

exp ( 




.... (11) 

Excepting for the factor (- b&G v /2Tri*kT) 112 , 
this equation for / is identical with equa- 
tion (10). 

The basic assumption that the free energy 

of embryos can be calculated by assuming D ORING distributions of 
them to be small droplets of the macro- embryos with respect to size 
scopic liquid phase, recently has been ex- 
amined critically by KIRKWOOD and BtiFF 10 and REiss. 11 The primary 
criticism derives from the fact that the nucleation rate can be per- 
ceptible only when the nucleus diameter is of the order of 10 molecular 
diameters or less (equation (6)). When the size is this small there is 
some ambiguity in the definition of surface tension. ToLMAN 12 and 

, ^TRANSITION" 



VAPOR 




Fig. 3. Variation of density of matter across transition zone 
between liquid droplet and vapour 

KiRKWOOD 13 have developed theories for the variation in the density of 
matter across the liquid-vapour interface which predict that the density 
changes from the liquid value to the vapour value in a transition zone 
(see Fig. 3), of the order of 2-5 molecular diameters in thickness. 
As a consequence of this transition zone, these authors show that the 
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surface energy a r of a small droplet of radius r should not be identical 
with a = cr Q for plane surfaces but should be related to a by the 
equation : 



where 6 is the difference between the value of r at the Gibbs surface of 
tension and its value (r r Q ) where the superficial density of matter 
vanishes. According to TOLMAN and KIEKWOOD, d is of the order of 
one to two molecular diameters hence, when d/r Q is 0-10 to 0-20, a r is 
0-8 to 0-7 cr . 

KIEKWOOD and BUFF IG and BuFF 14 though expressing reservations 
about the validity of using macroscopic thermodynamic parameters to 
describe the properties of nuclei, have modified equation (5) by taking 
into account the variation of a with curvature. BUFF'S development 
gives, instead of equation (5), the following expression for AC?* : 

- ? / 2 / 3 } .... (13) 

where y = a, + K 2 + 2(7 <5AGJ 1/2 . 

The ambiguity in the definition of the energy of interfaces having 
radii of curvature of only a few molecular diameters is not confined to 
liquid vapour interfaces but is inherent in the corresponding definitions 
for all types of interfaces. This problem must be reckoned with in the 
treatments of nucleation of crystalline phases in condensed systems. 

The V.W.B.D. theory and its modification tentatively proposed by 
KIEKWOOD and BUFF make substantially different predictions of the 
value of AC? V at which the rate of nucleation becomes perceptible. 
Alternatively, the V.W.B.D. theory predicts that the values of the inter- 
facial energy, cr, calculated from rates of nucleation should be in close 
agreement with that of CT O , measured for flat bulk liquid surfaces while 
the KiEKWOOD-BuFF modification predicts that a should be substantially 
less than a . 

Transient Nucleation BECKEE and BORING'S final equation for / was 
derived on the assumption that / is time independent. However, when 
the driving free energy of a phase transition is changed very rapidly 
from one level, (&Q v ) l9 to another, (AC?J 2 , there is a transient during 
which the embryo distribution characteristic of (^O v ) 1 changes to the 
distribution characteristic of (A6?J 2 . During this time the nucleation 
frequency I t is time dependent. 

The earliest formal treatment of transient nucleation is due to 
ZELDOViCH, 15 who derived an approximate relation I t =f(t). TUEN- 
BULL 16 pointed out the importance of transients in solid-solid transforma- 
tions and evaluated I t = f(t) precisely by a numerical mdohod for 
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certain simple cases. More recently KANTEOWITZ I? and PnoBSTElN 18 have 
developed the theory for transient nucleation of liquid in supersaturated 
vapours, which is of importance in interpreting the results of experiments 
in which the supersaturation is increased with great rapidity. With 
some simplifying assumptions ZELDOVICH derived from the BECKER- 
DORING theory the following equation for I t : 

I t & I exp [- i* 2 /4zs*t] ** I exp [- r/t], .... (14) 

where / is the steady nucleation frequency. KANTROWITZ takes into 
account the possible departure of the accommodation coefficient from 
unity and the difference in temperature between the embryos and their 
surroundings due to the heat liberated by the condensation. With 
these modifications, zs* in equation (14) is replaced by a function of the 
appropriate accommodation coefficients and heat content quantities. 
Without making the corrections introduced by KANTROWITZ the order 
of time r required for the transient in the nucleation of liquid droplets 
to expire is a microsecond. 

Formal Theory of Heterogeneous Nucleation It is probable that in 
most phase transformations the formation of nuclei is catalyzed by 
various foreign surfaces present in the system. In experiments designed 
to determine the rate of nucleation, the most difficult problem is the 
elimination of these surfaces or the specification of the nature of the 
substances effecting nucleation catalysis. 

VOLMEB 19 started with the basic assumptions of the V.W.B.D. theory 
and developed a formal theory for heterogeneous nucleation of liquid 
from supersaturated vapour. Suppose that a liquid embryo comes to 
equilibrium on a plane foreign surface with a contact angle 0, it follows 
that: 

a LV COS 0> .... (15) 



where a sv is the interfacial energy per area between vapour and 
catalyst, a LS is the interfacial energy per area between liquid and 
catalyst, and O LV is the interfacial energy per area between liquid and 
vapour and is identical with a in the theory of homogeneous nucleation. 
The equilibrium form of the embryo is a spherical cap and the free 
energy of formation is : 

A# s = [irr*0 LV + (7rry3)A#J[2 - 3 cos 6 + cos 3 6], .... (16) 

where r is the radius of the sphere. Tor all 6 > 0, A(? 5 goes through a 
maximum having the AC? S co-ordinate. 

A0. = A0*/(0), .... (17) 

where f(6) = (2 + cos 0)(1 cos 0) 2 /4, and AC?* is given by equation 
(5). We rfiay suppose that the embryos are in equilibrium with atoms 
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adsorbed on the catalyst surface from the condensing vapour. If the 
number of adsorbed vapour atoms is n s /area, the equilibrium number of 
embryos per area of a given size on the catalyst surface, n 8 . 9 is 

n t< = n a exp [- Afl/(0)/W] ..... (18) 

Following a development analogous to that in deriving equation (11) 
we find approximately for the nucleation frequency per area of catalyst 
surface, 7 3 , the following 20 : 

I B = zn s s*( b&G v /27ri*kT) 112 exp [- A<?*/(0)/fcT] ..... (19) 

Consider a unit volume of vapour containing a unit surface of a 
catalyst characterized by a single contact angle 0. Since n s ~ IG" 8 ^, 
it follows from equations (11) and (19) that for this system, 

lo glo (/.//)=- 8 + (AG*/2.3*r)[l-/(0)]. . . . (20) 



/ is perceptible when ( A6?*/2 . 3kT) ~ 25 ; therefore, I 8 is of the same 
magnitude or greater than / for all /(0) < 2/3 corresponding to 
C> 102. When = 0, I 8 can be perceptible with an infinitesimal 
degree of supersaturation. 

Comparison of Theory and Experiment Nucleation of liquid phases is 
catalyzed by a large variety of surfaces, and in most vapour-liquid 
transitions the supersaturation is relieved by diffusion of vapour to 
liquid heterogeneously nucleated on these surfaces. In order to obtain 
sufficient supersaturation to cause a perceptible rate of homogeneous 
nucleation the vapour is freed, as far as possible, from suspended 
particles and it is rapidly chilled by adiabatic expansion so that the 
supersaturation ratio P/P ,| builds up rapidly. With these conditions 
the value of P/P Q required for perceptible homogeneous nucleation can 
be reached before an appreciable amount of vapour has diffused to 
surfaces on which liquid has been heterogeneously nucleated. 

In experiments on nucleation of liquid droplets 21 in supersaturated 
vapours P/P is progressively increased in a series of different adiabatic 
expansions until a fog of liquid droplets is seen to form in the body of 
the vapour. The significant datum obtained from the experiment is the 
minimum value of (P/P Q ) m necessary to cause this fog formation. 
Experience indicates that the fog appears in these experiments at a 
fairly sharply defined and reproducible supersaturation ratio. This 
result is in agreement with the prediction of the V.W.B.D. theory that 
/ increases sharply with increasing P/P Q , e.g. an increase of 2 per cent 
in P/PQ may cause a tenfold increase in /. Because of this sharp 
dependence of the rate of nucleation on the supersaturation ratio, a 

f P is the vapour pressure and P Q is the vapour pressure in equilibrium with a flat 
surface of the corresponding liquid. 
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large error in assigning a value to / corresponding to (P/P ) m has 
effect on the value of a calculated from the nucleation data, i.e. the 
exponential term dominates equation (11). Therefore, in the VOLMER- 
FLOOD type of experiment, (P/P ) m is measured accurately and / is 
estimated within wide limits of uncertainty, within a factor of 100. 

VOLMEB and FLOOD 21 have measured (P/P ) m for various tempera- 
tures for water and some organic compounds. For an ideal vapour, 



CRT/F)ln(P/P ), 



(21) 



where V is the volume per mole of the liquid phase. VOLMEB and FLOOD 21 
compared their measured values of ln(P/P ) w with values calculated 
from equation (11), where A(?* is given by (5), AG V by (21), and 
assuming that a is equal to <r . 

Alternatively, a may be calculated from (11), by substitution of the 
measured value of (P/P ) W , and compared with or . The comparison of 
a with <r , the measured liquid-vapour interfacial energy for infinite 
radius of curvature, is shown in Table 1. 

TABLE 1 

Comparison of measured liquid-vapour interfacial energies with values 

calculated from VOLMER-FLOOD experiments. 

After TURNBULL and HoLLOMON ia 





Temperature 


a 




Substance 


K 


Calculated from 
VOLMER-FLOOD data 


Measured 


Water . 


275-2 


75-5 


75-2 


Methyl Alcohol 


270-0 


37-5 


24-8 


Ethyl Alcohol 


273-0 


24-4 


24-0 


w-Propyl Alcohol 


270-0 


26-2 


25-4 


iso-Propyl Alcohol . 


265-0 


23-6 


23-1 


n-Butyl Alcohol 


270-0 


26-2 


26-1 


Nitromethane 


252-0 


41-1 


40-6 



With the single exception of the result for methyl alcohol, where the 
deviation is 50 per cent, a and a agree within 2-3 per cent and well 
within the experimental uncertainty. From these results it is concluded 
that the V.W.B.D. theory describes the frequency of homogeneous 
nucleation of liquid droplets in pure vapours to within the limits of 
accuracjf attained in these experiments. The diameter of the nucleus 
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calculated from the VOLMER-FLOOD data is of the order of 5 molecular 
diameters and certainly of the same order of magnitude as the d of the 
TOLMAN and KIRKWOOD theories. The data are in disagreement with 
the prediction of BUFF'S modified theory that a ~ 0-8o- for the 
experimental conditions of VOLMER and FLOOD. 21 

SANDER and DAMKOHLER 22 have measured the rate of nucleation of 
water droplets in supersaturated water vapour at lower temperatures 
than those attained by VOLMER and FLOOD. Their calculated values of 
a are about 10 per cent less than a Q at the lowest temperature of their 
measurements. Further, SANDER and DAMKOHLER'S a appears to go 
through a maximum with decreasing temperature in qualitative agree- 
ment with BUFF'S theory. However, in the temperature range of 
overlapping measurements SANDER and DAMKOHLER'S a is about 7 per 
cent less than VOLMER and FLOOD'S. Recently POUND and MADONNA 23 
have repeated the experiments on nucleation of water including in their 
measurements the temperature range covered by VOLMER and FLOOD 
and SANDER and DAMKOHLER. POUND and MADONNA'S values of a 
are in good agreement with those of VOLMER and FLOOD and with cr 
down to a nucleus containing about 70 molecules. For i* < 70, 
or > a and at i* ~ 50, a ~ 0-9(T . 

WEGENER and SMELT 24 found that the degree of supersaturation 
reached in the very rapid expansion of moist air is much greater than 
would seem to be permitted by the V.W.B.D. theory. KANTROWITZ 
suggested that the supersaturation in WEGENER and SMELT'S experi- 
ments reaches a very high level because of the time lag in reaching the 
steady state value of / predicted by the V.W.B.D. theory. KANTRO- 
WITZ showed that these results are precisely described by the approxi- 
mate theory of transient nucleation with a single disposable parameter 
the accommodation coefficient, a, of water vapour molecules on a 
liquid water surface. However, the value of a ~ 5 X 10~ 4 required for 
the agreement seems too low. 

It appears, therefore, that the kinetics of nucleation of liquid droplets 
in supersaturated vapours is adequately described by the V.W.B.D. 
theory and it seerns permissible to ignore the effect of droplet radius on 
its surface energy. The success of the V.W.B.D. theory encourages us 
to explore the consequences of the use of analogous assumptions in 
extending the theory and in describing the nucleation of crystalline 
phases. 

Theory of Nucleation in Condensed System 

Most of the theories of homogeneous nucleation in condensed systems 
start with the basic assumptions of the V.W.B.D. theory. Thus the 
free energy for formation of embryos in condensed systems is assumed 
to be given by an equation identical in form with equation (2) with the 
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appropriate a and A(? v values and values of a and 6 consistent with an 
assumed shape of the nucleus. A$* is given by 

A#* = Ko*j(kG^ .... (22) 

where K is determined by the nucleus shape and is 167T/3 for a spherical 
nucleus. 

For nucleation in condensed systems the kinetic theory expression, z, 
for the rate of bombardment of the nucleus by molecules of the parent 
phase can, in general, no longer be used. The movement of an atom 
across an interface between two condensed phases usually requires a 
free energy of activation, &G A . On this basis BECKER 25 derived a 
general expression for the rate of nucleation in condensed systems. 

Recently TURNBULL and FiSHER, 26 again following the basic assump- 
tions of the V.W.B.D. theory, applied the absolute reaction rate formal- 
ism to nucleation in condensed systems and derived a more quantitative 
expression for /. In terms of the absolute rate theory the frequency, 
v, for transfer of an atom across an interface is : 

v = (kT/h) exp [- AG^/fcT], .... (23) 

where h is Planck's constant. Following the methods of BECKER and 
CORING they derived an expression for the rate of nucleation : 

I = K V exp [- (A<2* + &G A )/kT] .... (24) 

where K v = n*(aa/97rkT) ll2 n(kT/h), a (sec equation (2)) is determined by 
the nucleus shape, n* is the number of atoms in the surface of a nucleus 
of critical size, and n is the number of atoms per volume of the parent 
phase. 

For spherically shaped nuclei, 

(aa/SrrkT) 1 ' 2 = (a/kT) 1 ' 2 ^/^) 1 ^, .... (25) 

where v is volume per atom of the parent phase. 

Following the treatment in the preceding section we may write for 
the transient rate of nucleation in condensed systems : 

/, tv I exp [- i* 2 /4w*^], .... (26) 

where v is given by equation (23). Since v can be very small the transient 
period, r i* 2 /4n*v, can be much longer for condensed systems 16 than 
for the nucleation of liquid from vapour. 

The temperature coefficient of /, dl/dT is found, by differentiation 
of equation (24), to be : 

dl/dT = /{(A#* + &G A )/kT*-(l/kT)(d[kG*]/dT)}, .... (27) 

where &G A is assumed to be independent of T. I goes through a 
maximum / = / max at T = T max . T max is found by solving the equation 

= [d(W*)ldT] T _ T ^, .... (28) 
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obtained by setting dl/dT = 0. From the thermodynamic identity, 
(dO/dT) p = S, it follows that 

[d(&G v )ldT] v = ~&S v , .... (29) 

where A$ v is the entropy change per volume accompanying the phase 
transformation. Often it can be assumed to a fair approximation that 
A$ v is temperature independent whence it follows : 

&G v =kS v (T-T), .... (30) 

where T is the temperature at which the two phases of infinite mass are 
in equilibrium. 

Substitution of equations (22) and (30) into (28) gives a general 
relation : 



(^ma*- ^o) 3 + 

+ 2Ka*T m JW A (*S,)* = 0, .... (31) 
that can be solved for T max . When AG^ = 0, T nnx = T /3. If &G A > 0, 



The formal treatment of VOLMEB for nucleation of liquid from vapour 
on catalytic surfaces has been extended to the nucleation of crystalline 
phases in condensed systems. 27 Consider, for example, the nucleation 
of a crystal in a supercooled liquid. If it is assumed that the liquid- 
crystal interfacial energy a LC is isotropic, equations (15) and (16) apply 
with the subscripts SL, SO and LC replacing SV, LS and LV, respec- 
tively, where S refers to catalyst, L to liquid and C to the forming 
crystal having the same composition as the liquid. By a simple exten- 
sion of the treatment of TURNBULL and FiSHEB 26 the following expres- 
sion for I 8 for crystalline phases in condensed systems is obtained 28 : 

/, = K. exp [- HG*f(0)lkT], .... (32) 

where for nuclei shaped like spherical caps, 

K s = n*((7/yfc? 7 ) 1/2 (2i;/97r) 1/3 [/(0)] 1/ X()fc7 7 /A) exp [- &G A /kT], 

.... (33) 

a is the interfacial energy per area between forming crystal and the 
parent phase. 

As has been pointed out earlier 27 transients are as likely to be impor- 
tant in heterogeneous as in homogeneous nucleation. With I 3 substituted 
for / and using the appropriate values of i* and ^*, equation (26) gives 
an approximate relation for the transient in heterogeneous nucleation. 

Let a phase a be in equilibrium with a phase /? having the same com- 
position at temperature T Q . Below T Q , /? is more stable and above T , 
a is more stable. Suppose /? is heated to a temperature in the range of a 
stability. It follows that at this temperature /8 is less stable than a 
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on all plane foreign surfaces for which 6 > 0, where 6 is the contact 
angle between ft in contact with a and the foreign surface. 

However, VoLMER 29 has shown that material held in microcavities 
of suitable geometry in the foreign surface can be more stable as ft 
than as a, even in the temperature range of a stability for < 6 < 90. 
The basic reason is that for 6 < 90 there is a net gain in surface energy 
for that part of the ft surface that forms on the surface of a catalyst 
(see equation (15)). Therefore, any geometry of the ft form that 
maximizes the fraction a a of the ft surface forming on the catalyst 
surface and minimizes the fraction a a in contact with the parent a will 
favour the stabilization of ft. Evidently AT 

negative curvature in the catalyst surface ~ *"" "'* 

increases aja^. 

A theory for the thermal history pheno- 
menon in phase transformations, based on AT 
the concept of retention of ft in micro- \ NUCLEATION ON 

cavities, has been derived. 27 As the tern- SURFACE 

perature is increased above T , the size of 
the microcavities in which ft is retained is 
decreased and at very high temperatures 

becomes of the order of molecular dimen- Fig " 4 ' Effcct f superheating 
. rt , r . , . on the supercooling required to 

sions. After an element of material in a cause nucleation of embryos 
microcavity transforms from ft to a it can- retained in cylindrical cavities 27 
not revert to ft until the temperature is 

lowered sufficiently for the rate of nucleation of ft on a flat surface to 
be appreciable. 

Elements of ft retained in microcavities nucleate the transformation 
from a to ft. In order to serve as a nucleus the radius of the ft element, 
and hence the microcavity in which it is retained, must equal or exceed 
the critical radius r c * for nucleation of ft on a flat surface. Since r r * 
decreases with decreasing temperature it follows that the super- 
cooling, AT 7 .., will be less the larger is the maximum size, r m , of the 
microcavity in which ft is retained. Since r m decreases with increasing 
AT. } ., where AT + = T m ? T , and T m is the maximum temperature 
attained in the range of a stability, it follows that &T_ should increase 
with AT'.p On the basis of these concepts and assuming cylindrical 
microcavities the undercooling should be related to the degree of super- 
heating by the following equation 27 : 



tan = ATL/ATV .... (34) 

Fig. 4 shows AT_ as a function of AT+ for a particular value of 0. 

It is evident that equation (34) ceases to have meaning when AT_ 
exceeds the supercooling AT 7 _* required for a perceptible rate of 
nucleatioft on a flat element of the catalyst surface. AT 7 , is a linear 
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function of AT 7 ^ for AT_ < AT_* but is independent of A? 7 , when 
AT_ > AT.*. 

With suitable microcavities in its surface a substance characterized 
by a comparatively large value of may be as effective in nucleation 
catalysis as another substance characterized by a very small value of 6. 
For example, it is doubtful that substances, such as gallium, having very 
complex crystal structures ever crystallize from a liquid without very 
marked undercooling unless nucleation is effected by retained nuclei. 

In heterogeneously nucleated transformations it is possible for the 
preferred nucleation sites to be exhausted. AvRAMi 30 (now MELVIN) 
recognized this possibility and developed a formal theory to describe it. 

Suppose there are N Q preferred nu- 
cleation sites per volume in the system. 
These sites might be occupied by re- 
tained nuclei or by catalyst particles so 
small that the whole surface of a par- 
ticular particle will be engulfed by {} 
growing from the first ft nucleus that 
forms on its surface. If there is equal 
probability that a nucleus will form at 
Fig. 5. Frequency of heterogeneous any of the preferred sites the number N 
nucleation a a function of time per untransformed volume that have 
showing initial build up toward not nucleated in ti t is . 
steady frequency and final falling 

off because of exhaustion of N = N exp [//], .... (35) 

nucleation sites 27 

where / is the nucleation frequency /site. 

The nucleation frequency/volume is found by differentiation of (35) 
to be: 

exp [-ft]. .... (36) 




AVBAMI did not consider the possibility of an initial transient and 
his theory requires that I t always decrease with time. When the initial 
transient in I t is taken into account a more general relation : 

I t = N J **$[-- (ft + i+*l*n,*vt)}, .... (37) 

in heterogeneous nucleation is obtained that is indicated graphically 
in Fig. 5. I t increases from a value of zero to a steady value and then 
declines as the nucleation sites are exhausted. It is evident that the 
entire system may be transformed to /3 by growth of transient ft nuclei 
before the effects of exhaustion have become apparent. 

Nucleation Phenomena in Condensed Systems 

Fracture of Liquids The breaking of liquids subjected to hydrostatic 
tension takes place by the nucleation and growth of bubbles. Consider 
the formation of a spherical cavity in the body of a liquid subjected to a 
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hydrostatic tensile stress, P (negative pressure). Nucleation of these 
cavities can be described by the theory of nucleation in condensed 
systems. If P ^> P , the vapour pressure of the liquid, A6?, y = P and 
A$* = 167ror 3 /3P 2 , where a is the liquid- vapour interfacial energy. 
Following FiSHER 31 the frequency of bubble nucleation is : 

/ - n(lcT/h) exp [- AGJkT] exp [- ItorcPfiP*], .... (38) 



where n is the number of atoms per volume and AC?^, the free energy of 
moving an atom out of the liquid-vapour interface into the liquid, may 
be set equal to the free energy per atom for viscous flow. This formal 
treatment applies only to relatively fluid liquids. FiSHER 31 has shown 
that in very viscous liquids such as glasses, fracture by the nucleation 
and growth of thin disc-like cavities is a much more probable occurrence 
than fracture by the nucleation and growth of spherical cavities. 

Liquids usually prefer to pull apart at interfaces already present in 
the system rather than by homogeneous nucleation of cavities within 
the bulk of the liquid. Practically, it is very difficult to attain the 
negative pressure necessary to cause homogeneous nucleation of 
cavities before the pressure is relieved by the growth of a bubble that is 
nucleated heterogeneously at an existing interface. In order to mini- 
mize the likelihood of heterogeneous nucleation of cavities, foreign 
particles should be removed from the liquid and a container having walls 
that are strongly "wet" by the liquid should be selected. 

TABLE 2 

Comparison of Theoretical and Measured Fracture 
Stresses of Various Liquids. After FiSHEB 31 and 

BKRNATH 32 



Liquid 


Fracture Pressi 
' calc 


ire Atmospheres 
it 

- 1 exp 


Water 


1320 


350 33 


Ethyl Alcohol . 


223 


39 34 


Ethyl Ether 


138 


7234 


Benzene . 


338 


152 35 


Acetic Acid 


322 


288 35 



The criterion for the fracture of a liquid subjected to a negative 
pressure is that the rate of nucleation of cavities (/) must have some 
perceptible value (such as 1 /mole-second). In order to test the predic- 
tions of nucleation theory, negative pressures, P calc , corresponding to 

347 



PROGRESS IN METAL PHYSICS 

/ = 1 mole"" 1 sec"" 1 calculated from equation (38) are compared with 
values measured experimentally, P exp . In this comparison only the 
maximum negative pressures that have been reproduced experimentally 
are pertinent since it is presumed that the pressure required for homo- 
geneous nucleation must be equal to or greater than the maximum 
negative pressure that is observed. The comparison of P calc and P exp 
for various fluids is shown in Table 2, on previous page. 

Although the maximum negative pressure is nearly as large as the 
calculated pressure for acetic acid, in all other cases it is less by a factor 
of 2 to 6. It is probable that cavities were nucleated heterogeneously 
in all the latter experiments. 

The heterogeneous nucleation of cavities takes place according to the 
general theory already described. 

Crystallization of Supercooled Liquids The supercooling of liquids was 
observed almost as soon as temperature began to be measured quantita- 
tively. FAHRENHEIT (around the year 1725) probably made the first 
systematic study of supercooling. He discovered that water readily 
supercooled several degrees before beginning to crystallize and that 
different samples of water consistently behaved differently. For- 
tunately, from the standpoint of using the ice-water equilibrium 
temperature, as a fixed point in thermometry, ice crystals grow very 
rapidly so that, though different samples of water start to crystallize at 
widely different temperatures, the rapid release of heat associated with 
growing ice crystals quickly raises their temperature to very nearly the 
equilibrium temperature. 

According to VoLMER's 29 historical account it was realized more 
than one hundred years ago that the crystallization of supercooled 
liquids or supersaturated solutions is usually nucleated heterogeneously. 
These bodies are usually present accidentally and in too small an amount 
to permit chemical identification. 

The evidence indicating that crystal nucleation in supercooled 
liquids is usually induced by accidental impurities has been summarized 
in many earlier publications 30 * 27 > 2 . Perhaps the most important 
evidence is the specificity of the behaviour of different samples of the 
same liquid first observed by FAHRENHEIT and the reduced nucleation 
frequency observed after a liquid has been passed through a fine filter. 

The problem of minimizing the effect of adventitious nucleation 
catalysts in crystallization has been a baffling one and an apparently 
workable solution was not found until recently. 

For more than seventy years it has been known that small metal 
droplets supercool several tens of degrees before solidification. The 
most systematic early study of the phenomenon was made by INGERSOLL 
and MENDENHALL. 36 They found that 50 to 100 micron diameter 
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droplets of metals such as gold, platinum and rhodium often super- 
cooled over 100 when melted and cooled on a Nernst glower. There 
seemed to be a rough correlation between the maximum supercoolings, 
AT, and the melting temperature of the metal. For platinum the amount 
of supercooling was observed to be approximately 370C. They found 
that AT was generally greater the smaller the droplet. Their results 
coupled with the common knowledge that large continuous masses of 
liquids, such as gold, that crystallize to close-packed structures rarely 
supercool more than 2 or 3 indicate a tremendous effect of specimen 
size on solidification behaviour. 

Whether INGERSOLL and MENDENHALL or any of the earlier investi- 
gators appreciated the large magnitude and importance of this size 
effect is not entirely clear from the literature. 

The recent experiments of VoNNEGUT 37 on tin and of CwiLONG 38 and 
ScHAEFER 39 on water again focused attention on the size effect in solidi- 
fication. VONNEGUT studied dilatometrically the rate of solidification 
of a dispersion of small tin droplets, 1-10 microns in diameter, prevented 
from coalescing by an oxide film. In these experiments the period of 
crystal growth was negligible in comparison with that of nucleation. 
The rate of solidification was not perceptible until the dispersion had 
been supercooled of the order of 105 to 110. Large continuous masses 
of tin have not been supercooled more than 30C without crystallization. 49 

CwiLONG 38 and ScHAEFER 39 observed that snow crystals do not form 
at an appreciable rate in a fog of water droplets (10-20 microns in 
diameter) until a "critical" temperature 39 to 40 below the normal 
melting point of water is reached. Large continuous masses of water 
have only rarely been supercooled more than 20 without crystallization. 

These experiments clearly confirm the existence of a marked size 
effect in solidification. Thus, the frequency of nucleus formation (k) 
in a given specimen, is a function of the volume, v, of the specimen 
and the degree of supercooling, i.e. k f(v, AT 7 ). We may for a given 
liquid define an "isokinetic" relation as follows 28 : 

constant .... (39) 



With tin the magnitude of the size effect is such that the value of v 
changes by a factor of the order of 10 11 for a change of 80 in AT. 

It has been demonstrated 41 * 42 43 that only the theory that parti- 
tioning the liquid isolates the effective catalyst particles in a small 
fraction of the droplets seems compatible with all the facts. Suppose 
that the liquid containing ra adventitious nucleation catalyst particles 
per volume is dispersed into m droplets per volume kept from inter- 
communicating by suitable barriers. If m ^> ra , crystallization nuclei 
in most droplets either must form homogeneously or on the surface of 
added nwcleation catalysts, e.g. a crystalline film on the droplet surface. 
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The probability that a particular droplet contains an accidental 
nucleation catalyst is ~ m /m. 

The isolation theory can be tested by comparing the isokinetic 
relation it predicts with the isokinetic relation measured experimentally. 
Consider first the very simple case where all m catalyst particles have 
the same catalytic potency per area characterized by a single value of 
the contact angle, 0. The isokinetic relation for droplets containing 
the average number of catalyst particles per volume is the curve cd 
drawn schematically in Fig. 6. Curve ab represents the isokinetic relation 
for homogeneous nucleation. For large values of v(v l/m) ra ^> m 
and the measured isokinetic relation will follow curve cd. In the region 



LOG 



vo> l/m. 



AT 




Fig. 6. Schematic isokinetic relation for solidification of liquid droplets 
that may contain only one variety of nucleation catalyst 

v ~ l/ra the crystallization behaviour will be very erratic because there 
is an average of only one catalyst particle per droplet. Since these 
particles will be distributed statistically some droplets will not contain 
any, and others will contain several times the average number of 
catalyst particles. Therefore, the measured AT will fall on the curve ab 
for some droplets and above or below the curve cd for others. For 
l/m ^> v, the average measured AT will follow the curve ab. We have 
made an idealized representation (see solid curve Fig. 6) of the predicted 
experimental isokinetic relation. Note that, in agreement with 
nucleation theory, the absolute magnitude of the slope of ab is much 
greater than that of cd. 

The experimental evidence 44 definitely indicates that there is rarely, 
if ever, so large a discontinuity in the AT-log v relation as is indicated 
in Fig. 6. Rather the data suggest an almost continuous decrease of 
AT with decreasing log v as shown in Fig. 1. 

To account for this almost continuous variation of AT with log v 
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two additional postulates are necessary first, the ra catalyst particles 
are characterized by a variety of 6 values rather than a single one. 
That is, some catalysts will be more effective than others in promoting 
solidification. Second, the concentration of the more effective nucleation 
catalysts is likely to be smaller than that of catalysts that are less 
effective. From these postulates LEViNE 45 has derived a AT-log v 
relation that successfully describes the experimental data of DORSCH 
and HACKER 44 on the freezing of water droplets. LEVINE'S theory is 
formally identical with WEiBULL's 46 theory for the (see also FISHER and 
HoLLOMON 47 ) size effect in fracture. The basic postulates, due to 
GRIFFITH, 48 underlying WEIBULL'S theory are that fracture is initiated 

LOG v 
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Fig. 7. General isokinetic relation for a liquid containing a wide 
variety of accidental nucleation catalysts 

by structural defects and, that the density of these defects per area is 
greater the lesser their ability to initiate fracture. These postulates are 
analogous with the underlying postulates of the isolation theory for the 
size effect in solidification. 

It is reasonable to suppose that a liquid may contain a wide variety 
of nucleation catalysts. This variety may be due to chemical differences 
and/or to differences in the catalytic potency of substances due to 
heterogeneous surface structure. 28 

The concentration of catalyst particles will decrease as their effective- 
ness increases only if a few substances can be potent nucleation catalysts 
for a given liquid-crystal transition but a wide variety of substances 
can be catalytic to some degree. There is experimental evidence that 
favours tliis postulate. 

Probably the most quantitative support for the isolation theory and 
for the theories of homogeneous and heterogeneous nucleation applied 
to solidification are the results of TuRNBULL 28 on the kinetics of solidifi- 
cation of flispersions of mercury droplets coated with various films. In 
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these experiments the temperature range of rapid solidification was 
first measured in continuous cooling experiments. This temperature 
range is relatively insensitive to cooling rate because of the large 
temperature coefficient of the rate of nucleation. Crystal nucleation 
frequencies were deduced from isothermal rates of solidifications 
measured for several of the dispersions. From the isothermal data and 
the measured droplet size distribution it was determined whether the 
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Fig, 8. Isokinetic relation for solidification of mercury droplets 
coated with various films. After TuRNBULL 28 

nucleation frequency was proportional to the volume or to surface area 
of the droplet. 

Fig. 8 shows isokinetic relations for the various dispersions corres- 
ponding to a nucleation frequency 10~ 4 sec- 1 . The solid lines for droplets 
coated with mercury laurate (HgLau), mercury acetate (HgAc), and 
mercurous iodide (Hg 2 T 2 ) films are precise. Approximate relations for 
droplets coated with mercury sulphide (HgS), mercury stearate 
(HgSt), and HgX were deduced from continuous cooling experiments 
and some isothermal measurements of uncertain interpretation on 
HgSt and HgX dispersions. Chemical evidence suggested that the HgX 
film was a mercury oxide. Also drawn in the figure is the assumed 
course of the isokinetic relation for the bulk mercury sample contain- 
ing only the accidental catalysts. There is a large uncertainty in 
this curve, since it was inferred from only 3 points and the two 
lowest of them give only the upper limit of AT 7 for the corresponding 
v value. 
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These results confirm two of the most important predictions of the 
isolation theory. First, that there is no single isokinetic relation that 
describes the size effect in solidification, rather the isokinetic behaviour 
is clearly dependent upon the nature of the film on the droplet surface. 
The second is that the slope of the isokinetic relation for a particular 
catalyst is very small. For example, the isokinetic relation for droplets 
coated with HgLau gives AT = 70 when extrapolated to v == 1 cc. 
Since 1 cc masses of mercury almost always start to crystallize for 
AT < 4, it is concluded that they must contain relatively effective 
catalyst particles that become isolated from most of the mass when it 
is dispersed. 

It was established that the nucleation frequency in droplets coated 
with HgLau is proportional to droplet volume and that the kinetic 
coefficient, K v , that describes the data is in reasonable agreement with 
K v calculated from equation (24) predicted by the theory of homo- 
geneous nucleation. Comparison of the experimental and calculated 
K v is indicated in Table 3. These results are compatible with the 
interpretation that nuclei formed homogeneously in the HgLau coated 
droplets. The temperature coefficient of the nucleation frequency gives 
the value a 31-2 ergs/cm 2 (see table) when it is assumed that the 
nuclei are spherical. 

The disagreement (a factor of 10 7 ) between the experimental and 
calculated values of K v is outside the experimental error. A possible 
reason for this disagreement is that a and A$ v may not be independent 
of temperature as assumed in making the calculation. It was shown 28 
that the disagreement can be explained on the assumption of small 
and apparently reasonable magnitudes for the temperature coefficients 
of a and A/S^. It is also possible to account for the disagreement on the 
concept that a crystal nucleus may derive from a structure that is 
somewhat different from the most stable macroscopic structure, charac- 
terized by the normal melting temperature, T m . This concept is 
plausible because the structure that minimizes a is not necessarily the 
same structure that minimizes the Gibbs free energy of the crystal. For 
example, let the nucleus be strained an amount s relative to the stable 
macroscopic structure a. Then or is a function of , cr(e), and equation (1) 
for the free energy of forming the crystal nucleus is modified to : 

ce 2 ) .... (40) 



where c is the appropriate elastic coefficient. The macroscopic structure, 
/?, from which the nucleus derives will be characterized by a melting 
temperature, T m f , which is lower than T m and is found by solving 
the equation AG^ = A/S^AT = ce 2 . The free energy of forming the 
nucleus is : 

A<?*=/(r*, e*) .... (41) 
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where r* and e* are found by solving simultaneously the equations : 

[3(A0)/ar], = .... (42) 

= .... (43) 



Let the Gibbs free energy for formation of ft be A(7/ ; it then follows 
that: 

A6V = &S v (T m ' - T) - A# y A7" ..... (44) 

a(s) is not known ; however, we may calculate, from the measured tem- 
perature coefficient of / and substituting (44) into (24), the value of 
T m f that will bring the experimental and theoretical values of K v into 
agreement. The value of T m f thus calculated is about 11-12C below 
T m . Displacements of the order of 1-2 per cent in the lattice parameter 
of a gives a structure that should melt 11-12 below T m . 

The nucleation frequency, k D , of droplets coated with HgAc was 
proportional to the area rather than to the volume of the droplet. 
This result is in good agreement with the notion that the formation of 
nuclei in these droplets was catalyzed by the surface film. The tem- 
perature dependence of I s is satisfactorily described by equation (32) 
with the values of and K s given in Table 3. The experimental value 

TABLE 3 

Summary of Data on Isothermal Solidification of Small Droplets 





Jf'llm 


a 





Lot 


IK. 


Lof, 


IK. 


T T ' 


AT 






ergs /cm 2 




Expei 


Theor. 


Expfl 


Theor. 


-*- m m 


L ^- L m 


Mercury . 


HgLau 


31-2 








42-1 


35-1 


12 


79 


Mercury . 


HgAc 


31-2 


72' 


26-8 


27-3 









46 


Mercury . 


HgSt 


31-2 


87 


27-4 


27.3 









60 


Mercury . 


Hg 2 I 2 










69-6 


35-5 


31 


48 


Tin. 


Tin oxide 






30-0 


28-0 






6 


120 



of K s is in excellent agreement with the value predicted by the theory 
so T m f is displaced from T m by a negligible amount. 

Other results of the investigation of mercury solidification are 
summarized in the table. Solidification isotherms for droplets coated 
with Hg 2 I 2 were described on the basis that nucleation was catalyzed 
by particles, characterized by a single 6, suspended in the mercury and 
that either the mercury nucleus derives from a structure melting about 
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30 below T m or that the rate controlling step is not nucleation of 
mercury crystals but the nucteation of a phase, possibly of Hg 2 I 2 , that 
catalyzes the nucleation of mercury. 

Results for solidification of the Hg(St) and HgX coated droplets 
could not be described by a single value of 7 or I 8 at constant tempera- 
ture but could be described by a multiplicity (2 for HgSt ; a large 
number for HgX) of I s values. POUND and LA MEK/s 49 isotherms for the 
solidification of oxide-coated tin droplets can be described by a similar 
multiplicity in I 8 . Despite this multiplicity in I s the estimated K s 
values for HgSt-coated mercury and oxide-coated tin droplets are 
in excellent agreement with the theoretical values as indicated in 
the table. As a corollary T m ' is displaced from T m by a very small 
amount in comparison with the supercooling. Because of the great 
multiplicity in I s it was not possible to estimate K s for HgX-coated 
droplets. 

In summary, the data on the solidification of supercooled mercury 
and tin droplets are compatible, to well within the experimental uncer- 
tainty, with the theory of homogeneous and heterogeneous nucleation 
in condensed systems. 

A valuable feature of the theory of homogeneous nucleation in con- 
densed systems is that K v is calculable from the characteristic constants 
for the system. 26 It is thus possible on the basis of the theory to 
calculate the unknown parameter a from an / value for a single tempera- 
ture. We have confidence in the validity of this procedure because of 
the fair agreement between the theoretical and experimental K v for 
mercury. TURNBTJLL and CECH 50 have used a microscopic technique to 
study the solidification behaviour of small droplets (10-50 micron 
diameter) of a variety of metals. The maximum supercooling AT m 
before solidification that they obtained for the various metals is given in 
Table 4. Also listed in the table are AT m values for other substances 
obtained by other investigators using a variety of techniques.* It is 
supposed that the magnitude of the supercooling AT' corresponding to a 
perceptible rate of homogeneous nucleation is equal to or greater than 
AT m . Assuming AT' = AT^ 43 reasonable values of / were assumed and 
a for the various substances was calculated from equation (24). These 
calculations have been brought up to date and the resulting a values 
are listed in Table 4. 

* Recently there has been some dispute on the correct AT m for water. ScHAEFER 39 
and CwLLONG 38 agree that rapid formation of snow crystals begins when a supercooled 
fog reaches a temperature of 39. However, CwiLONG, 38 SANDER and DAMKOHLER, 22 
and POUND and MADONNA 28 all report that a fog formed at 50 to 62C by rapid 
expansion of water vapour may consist of supercooled water droplets. Their conclusion 
that the droplets were liquid was based on their failure to observe reflections from 
crystal facets that are expected from snow crystals. ScHAEFER 54 has shown, however, 
that a dispersion of small ice spheres can exist without any perceptible reflections. 
We have tentatively accepted SCHAEFER'S value of AT m . 
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TABLE 4 

Summary of Liquid -crystal Iiiterfacial Energies Calculated 
from Supercooling Data 



Substance 


Reference 


A7V 


a (ergs/cm 2 ) 


a a /T m cal/deg 


Mercury 


(28) 


77 


28-1* 


1-52 


Gallium 


(51) 


76 


56-0 


1-91 


Tin . 


(49) 


118 


59-0 


1-59 


Bismuth 


(28) 


90 


54-4 


1-52 


Lead . 


(28) 


80 


33-3 


0-80 


Antimony 


(50) 


135 


101-0 


1-59 


Aluminium . 


(50) 


130f 


93-0 


1-00 


Germanium . 


(50) 


227 


181-0 


1-71 


Silver . 


(50) 


227 


126-0 


1-00 


Gold . 


(50) 


230 


132-0 


0-99 


Copper 


(50) 


236 


177-0 


1-01 


Manganese . 


(50) 


308 


206-0 


1-11 


Nickel . 


(50) 


319 


255-0 


1-08 


Cobalt . 


(50) 


330 


234-0 


1-02 


Iron 


(50) 


295 


204-0 


0-88 


Palladium 


(50) 


332 


209-0 


1-01 


Platinum 


(36) 


370 


240-0 


1-05 


Water . 


(52) 


39 


32-1 


1-69 


Benzene 


(52) 


48 


17-2 


2-58 



* Calculated on basis outlined in reference 43. 

f Recently FALKENHAGEN and HoFMANN 53 have, using very large cooling rates, 
supercooled aluminium about 195. This value of AT OT gives a a g j/^H f ^ 0-48 and in 
better agreement with the other values of a a l&H f for face-centred cubic metals. 

The gram-atomic interfacial energy, a g , between liquid and crystal 
phases is defined as the free energy of a crystal-liquid interface con- 
taining one Avogadro number N of atoms. Assuming that the inter- 
facial energy is localized in a layer one atom thick it follows 43 : 

a g = N l t*V 2 '*a, . . (45) 
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where V is the gram-atomic volume of the crystalline phase. TURN- 
BULL 43 showed that a g , calculated from nucleation rates, is roughly 
proportional to the gram-atomic heat of fusion &H f . a g /AH f (see 
Table 4) ranges from 0-32 to 0-61. a (J for the various substances is 
plotted against &H f in Fig. 9. Points for most of the metals fall fairly 
well on a common line of slope 0-46. 

Large liquid masses of metals that crystallize to close-packed struc- 
tures such as gold, copper or lead rarely supercool more than 5 before 

3000 r- 
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IOOO 2000 3000 4000 5000 60OO 

Fig. 9. Gram-atomic interfacial energy, a g , as a function of the heat 
of fusion, AH/. Revision of TuRNBULL 43 

beginning to crystallize. For example, DANILOV and NEUMARK, after 
subjecting lead to what they believed to be a satisfactory purification 
procedure, found that it started to crystallize at 3 supercooling. On 
the other hand, large liquid masses of metals that crystallize to more 
complex non-close-packed structures, such as tin, bismuth or gallium, 
often can be easily supercooled 25 or more before starting to crystallize. 
These results led to the concept advanced by DANILOV and NEUMABK 40 
and others that the supercooling corresponding to a perceptible nuclea- 
tion rate, AT W , is strongly dependent on the structure of the forming 
crystal and for liquids crystallizing to close-packed structures &T m < 5. 
DANILOV and NEUMABK, for example, believed that &T m ~ 3 for lead. 
This interpretation was considered plausible on the basis of the widely 
held view that the arrangement of atoms in most liquid metals approxi- 
mates a. close-packed structure. Thus the transition liquid (semi-close 
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packed) -> crystal (close packed) could presumably take place with a 
minimum of atomic displacements. These views of the liquid -> crystal 
(close packed) transformation apparently were developed in ignorance 
of the results of INGERSOLL and MENDENHALL 36 and other earlier 
investigations of the supercooling of gold, etc. Certainly these results 
coupled with results summarized in Table 4 establish that ^T m /T m 
ratios are not widely different for metals whether or not they crystallize 
to a close-packed structure. Also BARDENHEUR and BLECKMANN 55 
have been able, under certain conditions, to supercool a 150 g mass 
of liquid iron 258 before the beginning of crystallization. 

Although the nature of the forming structure has not a marked effect 
upon &T m /T m in solidification there is a rough correlation between the 
magnitude of the ratio a g /T m (see Table 4) and the structure com- 
plexity. 43 This ratio ranges from a value of 1-9 for gallium to about 
1-0 for metals that crystallize to close packed structures. 

A substantial proportion of accidental catalyst particles are likely 
to be cubic or hexagonal in structure and may, therefore, be capable to a 
greater or lesser degree of catalyzing the formation of cubic or hexagonal 
crystals. 43 On the other hand, very few substances have a structure 
very similar to that of gallium and, therefore, capable of effectively 
catalyzing the formation of gallium crystals. In a random selection of 
catalyst particles that might be present accidentally in a liquid metal 
it is probable that a large proportion would have cubic or hexagonal 
structures but few, if any, would have a structure similar to that of 
gallium. Therefore, a liquid that crystallizes to a complex structure 
will probably supercool much more relative to its melting point than a 
liquid that crystallizes to a cubic or hexagonal structure. 

Although WEBSTER 56 and others found a pronounced thermal history 
effect in the solidification of large masses of bismuth, no measurable 
thermal history dependence of supercooling in the solidification of small 
bismuth droplets has been found. 51 These results show clearly that the 
thermal history effect depends for its existence upon the presence of 
crystal nucleation catalyst particles in the liquid. 

Crystal -> Liquid Transitions The frequency of homogeneous nuclea- 
tion of liquid droplets in the body of a crystal should be given by an 
equation identical in form with equation (24). This theory predicts that 
the homogeneous formation of liquid nuclei should not become percep- 
tible until the superheating becomes almost as great as the supercooling 
required for a perceptible rate of homogeneous crystal nucleation in the 
liquid. 

Experience indicates that crystals almost invariably begin to melt 
at temperatures not perceptibly different from the equilibrium tem- 
perature. Also it is known that most crystals are completely wet by 
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liquid phase of the same composition at the melting temperature, 
hence : 

<*LV + a Lc .... (46) 



Therefore, there can be no net increase in surface energy when a liquid 
forms on a free solid surface of the same composition and A(7,* for 
forming liquid nuclei on this surface is zero. For this reason crystals 
will not superheat when they have any part of their surface exposed to a 
gas phase. 

Thus it is only possible to superheat solids by preventing in some 
way the exposure to vapour of the solid surface at the equilibrium 
temperature. KHAIKIN and BENBT 57 have succeeded in superheating 
the interior of a tin single crystal 1 to 2. In their experiment a large 
electric current was passed through the crystal. The energy was carried 
away from the free surface of the crystal so that a temperature gradient 
of 1-2 existed between the crystal axis and the surface. Although poly- 
crystals and defective single crystals invariably started to melt inter- 
nally sound single crystals always started to melt from the free surface, 
indicating that the material at the crystal axis was superheated 1-2. 
This amount of superheating is only a very small fraction of what is 
possible theoretically (~ 100 for tin). 

An experimental approach to the problem that has not yet been 
successfully worked out is to completely coat small crystals with a sub- 
stance that strongly catalyzes their formation in supercooled liquids. 
For example, it should be possible to substantially superheat tiny ice 
crystals covered with a coherent coating of silver iodide. 

STRUCTURE AND COMPOSITION 

Among the common types of transformation involving a change in 
structure and composition are: (1) condensation of liquid from a 
gaseous solution ; (2) the reverse process of formation of gas bubbles in 
liquid solution; (3) precipitation of crystalline phases from liquid 
solutions ; (4) precipitation of crystalline phases from solid solutions. 
We shall ignore the first two types and treat the last two, which are of 
most interest in metal systems. 

In the nucleation of crystals from solid or liquid solutions the 
important variables are structure, the strains due to the disregistry of 
the structures, composition and temperature. In our first approxima- 
tion we will neglect effects due to strain. 

Thermodynamics of Precipitation 

The Gibbs free energy, G, of a solution is a function of composition x 
(mole fraction of one component), temperature T and pressure P. 
Fig. 10 (a) is a schematic representation of G as a function of x (mole 
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fraction of B) at constant T and P. Here, is a continuous function of 
x and applies to a series of solid solutions of one crystal symmetry (e.g. 
face-centred cubic) or liquid solutions. Fig. 11 shows two 0= f(x) 
relations, each characteristic of phases a and ft (either two solid solutions 
or a liquid and a solid solution). The most convenient criterion of 
equilibrium in these systems is that O be a minimum at constant T and 
P. Applying this criterion to the system represented in Fig. 10 (a) it 




Fig. 10. (a) Free energy of a solid solution as a function of composition 

(b) Temperature composition equilibrium diagram for two components 

partially miscible in the solid state 

follows that the composition of the solution ranges continuously for 
x < x l and for x 2 > x. For all x l < x < # 2 , the free energy of the system 
is minimized by the coexistence of two phases of composition x and x 2 
[a and b are points of tangency of the line ab to G = f(x)]. Similarly it 
is found that the free energy of the system represented in Fig. 1 1 is a 
minimum for # a < x < x ft when two phases of composition # a and x^ 
coexist. 

In the conventional representation of phase equilibrium the phase 
coexistence relations are projected on a T-x plane for constant P as 
shown in Fig. 10 (b). Also plotted as dotted lines (hd and he) in Fig. 
10 (b) are the spinodal relations. The points on these cur vesf fulfil the 
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condition [3 2 G/dx 2 ] T> P = 0. In areas Ma and heb [3 2 G/dx 2 ] T ^ P > 
and in area hde [3 2 G'/dx 2 ] T) P < 0. 

Now consider fluctuations in composition in metastable solid solutions 
for x l < x < x 2 . A cluster of atoms formed by a fluctuation may, for a 
given mean composition of the supersaturated solution x, be charac- 
terized by the number of atoms, i, that it contains and A#, where 
A# x' x y and x' = the composition of the cluster. It is assumed, 
following BECKER, 25 and BoRELius, 58 that the composition of the 
matrix immediately surrounding the cluster is x. In our discussion we 
shall consider only fluctuations favourable to the nucleation of x 2 . 





ixx, 



Fig. 11. Free energy composition 

dependence at constant temperature 

of two solid solution structures, 

a and ft 



AX 

Fig. 12. Free energy of a 

cluster as a function of the 

fluctuation in composition 

at constant size. After 

BORELIUS 58 



Let AC? be the free energy increase per gram atom, neglecting inter- 
facial energy, for the formation of a cluster. It follows from thermo- 
dynamic principles that 

A<3 = 0' - (0 + &KdG/3x) .... (47) 



where Q' is the free energy per gram atom of solution at mole fraction #', 
is the free energy per gram atom of solution at mole fraction x, 
SG/dx is the slope of G = f(x) relation at x. AC? is the difference in G 
co-ordinates at a given x' of G = f(x) and the tangent to G = f(x) at 
point x. For the matrix composition corresponding to point c, A6? is 
proportional to the length of the line df at x' x and to minus the 
length of the line bg at x' = x 2 . 

Fig. 12 shows AC? ~/(A#)"for a given matrix composition x. The 
function goes through a maximum at the spinodal composition x. For 
x' > Xi at constant i the free energy of a cluster is decreased by 
fluctuations increasing Ax but for x' < x the reverse is true. 

Let ACr 2 be the free energy per gram atom for forming clusters with 
x' = x 2 , neglecting surface energy again. A6? 2 as a function of matrix 
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composition x is represented schematically in Fig. 13. A6? 2 = f(x) goes 
through a minimum at the spinodal composition x = x and a maximum 
at spinodal composition x = x%. 

Nucleation of a New Phase 

AC? t , the increase in free energy due to the formation of a cluster con- 
taining i atoms, is a function of T y x and A#. For a given value of T and 
x we may write 

.... (48) 



where or, the interfacial energy per area between the cluster and matrix, 
is a function of x and A#, a is a shape factor, A$ is given by equation 





Fig. 13. Free energy per gram atom of a cluster having the equilibrium 
composition of the precipitate from a solid solution 

(47), and N is Avogadro's number. The number, n { , of clusters per 
volume in equilibrium with the matrix is 



n 



exp [ 



(49) 



where n Q is a slowly varying function of A#. 

For a given value of x, AG^ can be plotted as a function of i and A#. 
In order to become a nucleus for the growth of phase of composition x 2 , 
a cluster must have attained a critical size and composition (critical 
cluster) such that further increase of either i or A# will decrease A0 t . 
From the surface A6? t =f(i, A#) the minimum, A6r, = ACr t *, for the 
formation of a critical cluster can be found. 

The location of the saddle point can be found formally from the 
following equation : 



A6?, = ai 2 / 3 /(Ax) 



(50) 



obtained by substituting a = /(A#) and A6? F(kx) in equation (48), 
where /(Ax) and F(&x) are functions of temperature and matrix 
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composition. By setting [3(A(?,)/9i] A:c = 0, we find the height of the 
surface ridge AG^ to be : 

AG, = AU(*x)}*l\F(toW .... (51) 

where A 4a 3 JV 2 /27. The value of A# = A#* at the saddle point is 
found by solving the equation 

3F(kx)f(&x)-2f(&x)F'(kx) = 0, .... (52) 

obtained by equating d(A6? t . )/d(Aa;) to zero. Substitution of A#* into 
(51) gives ACr t *. The practical solution for the saddle point position 
from equations (51) and (52) is complicated by our lack of knowledge of 
the interfacial energy and its dependence upon composition. A simpler 
solution is obtained if, following BECKER, 25 it is assumed that the 
position of the saddle point is always at x 2 . Then AC? = A(? 2 and a is a 
function of x only, a = f(x) so that 

A0<* = .4<r 3 /(A(? 2 ) 2 . .... (53) 

By a nearest neighbour approximation BECKER estimates a 
= (E/v 2l *)(x x 2 ) 2 , where ^ volume per atom of precipitate, E 
E AB ~ ^/^(EAA + EBB) anc ^ AA-> e ^ c > are the pairwise interactions 
of the atoms A and B making up the solution. HoBSTETTER 59 allowed 
the composition of the nucleus to vary and derived a formal expression 
for the rate of nucleation. 

From the theory of nucleation in condensed systems it follows that 
the steady state frequency of homogeneous nucleation of precipitate in a 
supersaturated solution is (see equation (24)) : 



/ == (x - x 1 )n*(aa/97rkT) ll2 n(kT/h) exp [ 

.... (54) 

where AG Z -* is given by equation (51) or (53), &G A is the free energy for 
diffusion of component B in the solid solution and the other terms have 
the same significance as in equation (24). 

We have assumed that a is isotropic or that the critical clusters are 
bounded by crystallographically similar planes. It is easy to refine 
the treatment to take into account the nucleation of clusters exhibiting 
simple departures from spherical or cubical forms. For example, sup- 
pose that, due to the anisotropy of the interfacial energy, the critical 
clusters are disc-like. Let the radius of the disc be r and its thickness 
t, then : 

.... (55) 



where V is the volume per gram atom of precipitate and a^ and <7 2 
are the Interfacial energies per area of the disc edge and face 
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respectively. It follows that the radius r* and thickness t* of the 
critical cluster are : 

r* = 2^V '/A# 2 .... (56a) 

t* = - 4(r 2 F/A(?2 .... (566) 

whence it follows : 

.... (57) 



In discussing precipitation from solid solution, KoNOBEYEVSKi 60 and 
BoRELius 58 have ignored the interfacial energy between the cluster and 
the matrix and set A6? t = (i/N)&G. Even though not implicitly stated 
these theories, if valid, must apply whether a and /? are liquid or solid 
phases. Recently this theory has been further developed and advanced 
by BORELIUS. According to BORELIUS' theory, A(? f * must lie along the 
ridge of the A6r t = f(i, Ax) surface corresponding to the spinodal com- 
position #/. However, the height of this ridge is a minimum when 
i = 1. A literal application of the KONOBEYEVSKI-BORELIUS hypo- 
thesis, therefore, leads to the prediction that A6? t * is virtually zero for 
any T and x. In order to obtain a partial description of the experi- 
mental data BORELIUS finds it necessary to postulate that the critical 
cluster crosses the ridge at AC? t * = (i*/N)(AG) Xj , 9 where i* is of the 
order of 100 to 200 atoms. It is presumed that a fluctuation must 
include 100 to 200 atoms in order to be significant but there is no theory, 
based on the KONOBEYEVSKI-BORELIUS postulate, that permits any 
prediction of the magnitude of i*. 

BORELIUS has analyzed certain data on rates of precipitation and 
shown that there appear to be certain singularities associated with pre- 
cipitation from supersaturated solutions initially having the spinodal 
composition. When log T, where r is the time required for half -com- 
pletion of precipitation, is plotted against l/T the curves for various 
values of #, the initial matrix composition, are parallel at relatively low 
temperatures. That is, the slopes of the curves are a function of T 
only. It is found that the temperature T' at which the slope becomes 
dependent upon x as well as T corresponds to the temperature at which 
x = Xi, the spinodal composition. BORELIUS writes for r, 

r=/ 1 (AO i *)/ a (a:)/3(T) .... (58) 

where f 2 (x) and/^T) are factors that describe the rate of growth of the 
precipitate. Since AG^* = f(x, T) and at T < T 1 , AG<* = 0, it follows 
from (58) that dr/dT varies with x for T > T' and is independent of x 
for T < T' . This agreement with experience constitutes the main 
evidence for the KONOBEYEVSKI-BORELIUS theory. There are, however, 
certain facts that weaken the significance of the agreement. BORELIUS' 
development is intended to describe the rate of nucleation at some 
specified matrix composition. However, in testing his theory 
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assumed that 1/r is proportional to the nucleation frequency at zero 
time. Since r must be a resultant of the rate of nucleation and growth 
of precipitate, neither of which were measured independently in the 
experiments cited, the validity of this assumption is doubtful. It 
should also be realized that in considering the precipitation from solid 
solutions strain energy induced by the transformation must be taken 
into account. 

There are some other important characteristics of spinodal solutions 
that might cause singularities in their behaviour. Most noteworthy is 
that the driving free energy, A6r 2 , for the nucleation and growth of the 
precipitate of equilibrium composition, is a maximum (see Fig. 13) at 
the spinodal composition. Therefore, it may be possible to explain a 
singularity in the kinetic behaviour at the spinodal temperature (for a 
given x) in terms of rates of precipitate growth or nucleation of clusters 
of composition x 2 without recourse to the KONOBEYEVSKI-BORELIUS 
theory. 

Comparison of Theory and Experience 

In order to compare the theory with observation several conditions 
must be fulfilled. It must be assured that the nucleation of the new 
phases occurs homogeneously, and the strain energy induced by the 
transformation is negligible. The frequency of nucleation must be 
measured directly. Even more importantly for a quantitative compari- 
son, the dependence of free energy, AC?, and the interfacial free energy, 
a, on the composition must be known. The former dependence can be 
measured by determining the energetics of the phases when both stable 
and metastable, or by studying the energetics of the stable phases and 
obtaining the difference of free energy by extrapolation. In principle, 
the dependence of the interfacial energy upon composition can be 
measured, but unfortunately, experimental difficulties have apparently 
mitigated against its successful determination in any case. 

While no direct measurements of the frequency of nucleation have 
been made for the formation of a metallic solid solution from either a 
liquid or a solid solution, the temperature at which the nucleation rate 
becomes sensible has been determined for solidification in several binary 
systems. 61 ' 62 As indicated previously, since the rate of nucleation from 
fluid liquids is so temperature dependent, a nearly critical temperature 
of nucleation of the solid will be observed. In measuring the critical 
temperature for two series of liquid solutions the small particle tech- 
nique already described was applied. Results were obtained in the 
case of the Cu-Ni system 61 in which there is complete miscibility in both 
the liquid and the solid states, and for the Pb-Sn system 62 in which 
there is miscibility in the liquid but immiscibility in the solid state. 
The latter is a simple eutectic system. The variation of the temperature 
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at which the nucleation frequency became sensible with composition 
are given for both these systems in Figs. 14 and 15. In the case of the 
Cu-Ni system the critical temperature of nucleation nearly parallels 
both liquidus temperatures and is continuous from the copper to the 
nickel side of the phase diagram. The maximum supercooling is of the 
order of 20 per cent the absolute melting temperature and appears to 
occur by essentially the same process in the alloys as in the pure 
elements copper and nickel. In the case of the Pb-Sn system the 
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Fig. 14. Solidification temperatures of Cu-Ni alloys as a function of 
composition. After CECH and TuRNBULL 61 

critical temperature of nucleation increases with the first small addition 
of tin to the lead as well as with the first small addition of lead to tin. 
Thus, for all compositions the magnitude of the supercooling is less than 
for the pure elements. These results indicate that the interfacial energy 
between the pure solid and liquid is decreased slightly by the first 
addition of both tin and lead, otherwise the results are very similar to 
those obtained with pure metals. However, since as already indicated, 
the dependence of the interfacial free energy on composition is not known 
for either of these systems, a quantitative comparison of these experi- 
ments with theory is impossible. Neither can the interfacial free energy 
be obtained from the nucleation results as was the case for the solidifica- 
tion of pure metals. However, the results are compatible with the 
general nucleation theory. 

In considering precipitation from solid solution the strain energy 
induced by the transformation generally must be taken into account, as 
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indicated in the next section. Although there are no direct measure- 
ments of the frequency of nucleation during precipitation, many 
measurements have been made of the overall rate of the process, R. 
This overall rate is a composite of the rates of nucleation and growth of 
the precipitate. From J?, it is possible, with the aid of various assump- 
tions, to deduce something about the rate of nucleation. We have 
already noted that BORELIUS assumed that the value of R when 
precipitation is half-completed is proportional to / when precipitation 




Fig. 15. Initial (X) and final (A) solidification temperatures of lead rich 
lead-tin alloy droplets as a function of their composition. Also solidi- 
fication temperatures (D) of tin rich lead-tin alloys as a function of 
composition. After HOLLOMON and TuRNBULL 62 

has just begun. With the same assumption HARDY 63 showed that the 
data of BORELIUS et a/ 64 on precipitation of tin from lead rich solutions 
can be described qualitatively by the BECKER theory of nucleation. 
BECKER made similar assumptions when comparing the results of 
JOHANSSON and HAGSTEN 65 on precipitation from an Au-Pt alloy with 
the predictions of his theory. 

A procedure far more satisfactory than using a single datum on the R 
vs t curve is to determine the functional relation between R and 
t[R = /()] and find what assumptions about / are compatible with it. 
An example of this procedure is the analysis by WERT and ZENER 66 of 
WERT's 67 data on R = f(t) at various temperatures for the precipitation 
of iron carbide from a-iron. Making reasonable assumptions about 
the growth of the precipitate and taking impingement into account 
they were able to describe the R = f(t) relations on the assumption 
that all ftuclei were effectively in existence at zero time indicating 
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that nucleation was heterogeneous. There are in most solids non- 
equilibrated structural defects. These defects as well as accidental 
inclusions might serve as preferred nucleation sites. 

STRUCTURE AND STRAIN 
Incoherent Nucleation 

A change in the specific volume of the system accompanies most phase 
transformations. When transformation occurs within extensive gas or 
liquid phases of high fluidity the volume changes are accommodated by 
flow of the extensive phase. However, crystals and glasses can be 
subjected to a considerable stress without flowing at appreciable rates. 
It follows, therefore, that the volume change accompanying the forma- 
tion of an element of a new phase in a crystalline or glassy medium may 
have to be accommodated by strains set up in the medium and possibly 
the new phase. Therefore, it is necessary to consider the strain energy 
as part of the free energy of the formation of an embryo. This trans- 
formation strain energy will be proportional to the number of atoms 
contained in the embryo and equation (2) is modified to : 

E) .... (59) 



where E is the increase in strain energy of the system accompanying the 
formation of a unit volume of an embryo. Evidently nucleation will not 
occur unless A<7 V + E < 0. 

A crystal usually begins to flow when a macrostress corresponding to 
an elastic strain energy E ~ 0-01 cal/cc is applied. However, according 
to the theory of plastic deformation perfect crystals should flow only if 
E > 100 cal/cc. Real crystals flow at very low applied stresses only 
because they contain non-equilibrated imperfections. These imperfec- 
tions (FRANK-READ 68 sources) generate dislocations that lead to plastic 
flow upon the application of small stresses. It is believed that the con- 
centration of these imperfections in a well-annealed crystal is so small 
(~ 10 12 /cc) that there is a negligible probability that a nucleus, which 
has a volume of the order of 10~ 20 cc, formed at random will contain one. 
Therefore, in the vast majority of sites available for its formation trans- 
formation stress energies of the order of 100 cal/cc can be supported. 
However, the ability of elements of real crystals to support such strains 
varies from element to element and it is expected that when the trans- 
formation stresses are large nuclei will form preferentially in elements of 
the crystal that do not, because of imperfections in their vicinity, sup- 
port large stresses. In general, Frank-Read sources will not operate 
unless the stress is applied throughout an element of volume much larger 
than 10~ 20 cc. These sources will, therefore, play an unimportant role in 
relieving transformation stresses in nuclei formed in their vicinity. 
However, stresses in very small volume elements in the vicinity of grain 
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boundaries and free surfaces probably can be partially relieved by flow.* 
For these reasons we expect grain boundaries and free surfaces, to be 
preferred sites for nucleation in crystalline media, f 

Since the dislocations in a real crystal are not in thermal equilibrium 
A(? v = A6?/ for volume elements transforming in the vicinity of 
dislocations is not necessarily identical with ACr v for the average volume 
element. We expect that there will be some dislocation arrays for which 
A(? v > A(?/. Further the magnitude of the elastic coefficients in the 
neighbourhood of dislocations may be markedly different from the 
mean elastic coefficients of the crystal. For these reasons some arrays of 
dislocations in the body of the crystal will be preferential sites for 
nucleation. 

When a nucleus forms on a grain boundary some of the surface work 
required for its formation is regained by destruction of the grain 
boundary which has a free energy per area equal to a g . The free energy 
for forming an embryo on a grain boundary is : 

&G=(aa-ga g )iW + (kG v + E)bi, . . . . (60) 

where it is assumed that a is not dependent on crystal orientation and g 
is determined by the shape of the boundary region destroyed. For 
example, if the embryo is spherical in form, equation (60) may be 
rewritten : 

A(?^(4(r-<r>r 2 + (&G V + E)frn* (61) 

We have noted that in liquid and gas media nucleation may be 
heterogeneous because nuclei form preferentially on the surface of 
accidental inclusions. While accidental inclusions are probably impor- 
tant catalysts for nucleation in crystalline media the factors sum- 
marized above probably also contribute to an important degree to the 
heterogeneity of nucleation in such media. On the basis of these con- 
siderations it is expected that free surfaces, grain boundaries, inclusions, 
and elements in the body of the crystals having a high stress energy will 
serve as preferred sites for the nucleation of phases in crystals. For 
small | AG^I it is probable that nucleation in crystals is entirely hetero- 
geneous. 

However, we shall develop the theory for the nucleation of a crystal- 
line phase in a perfect crystal. The main features of this development can 
be adapted easily to a formal description of heterogeneous nucleation. 

NABAERO 69 has developed the theory for the relation between strain 
energy due to precipitate particles and their shape and coherency. His 
treatment will be adapted to the development of the formal theory of 
nucleation for phase transformations involving no composition change. 

* We are indebted to E. W. HAHT for advice on these points. 

f There are additional reasons, to bo discussed, why grain boundary sites should be 
preferred* in nucleation. 
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Consider the phase transformation a -> ft, where a and ft are crystal- 
line phases. Let a be the lattice spacing of some low index plane, P l9 
in the ft lattice and a + Aa the lattice spacing in similar low index 
plane of the a lattice, then we define the disregistry, d : 

d = |Aa/a | .... (62) 

In general the ft nucleus may be strained an amount : 

fi = |(s-a )/a | .... (63) 

where x is the lattice parameter of the P 1 plane of the nucleus. When 
e = d the nucleus is said to form coherently. 

Suppose that the ft embryo is completely incoherent with a (i.e. 
s = 0). The accompanying volume change is accommodated by strain 
in the embryo and in the a matrix surrounding it, the strain distribution 
depending on the elastic coefficients of the two structures. However, 
following NABARRO, the strain energy of the system is a minimum when 
incoherent ft has the form of a thin disc and all the strain is in the 
surrounding a. 

Consider a hole in the a having the form of an ellipsoid with semi- 
axes a, a, and b. NABARRO calculated the strain energy per volume, E, 
required to expand the hole for the limiting forms thin disc, sphere and 
needle and proposed the following general relation between strain energy 
and shape : 

E = K\&V\ 2l *f(b/a), .... (64) 

where AF is the difference in specific volume of the non-stressed a and 
ft structures, K is proportional to the rigidity modulus of a, f(b/a) = 
for b/a = 0, and is a maximum for a sphere (b/a = 1). 

Although the strain energy of an incoherent embryo is zero when it 
has the form of an infinitely thin disc it will probably have the form of 
an oblate spheroid so that the a ft interfacial energy will not be too 
large. The free energy for forming the embryo is : 

A r |AF| 2/3 /(l - e 2 ) 1 ' 2 } + 



ira* 



where e = (1 6 2 /a 2 ) 1/2 . The minimum free energy for forming a ft 
nucleus is : 

A0*=/(a*,e*), .... (66) 

where a* and e* simultaneously satisfy the equations : 

[3(AG)/3a] a = .... (67) 

[3(A0)/36] a = ...*. (68) 
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Coherent and Semi-coherent Nudeation 

Although the increase in strain energy due to the formation of incoherent 
ft in the form of a thin disc is very small the surface energy may be 
quite large. Now suppose ft forms so that a plane P l and crystallo- 
graphic direction, d, is parallel to a plane and direction of similar atomic 
packing in the a. It has been postulated that the interfacial energy 
between two phases is a minimum when they are in perfect registry 
(8 = e)' across their boundary. 

In general, <5 > e and the actual disregistry is equal to 6 e. For 
d s not too large (e.g. <5 e < 0-20) the a- ft interface across plane P l 
can be pictured as consisting of small regions of relatively good fit 
separated by an array of intersecting line dislocations. | The density 
of dislocations in this array is 6 e. FRANK and VAN DER MERWE 72 
demonstrated that in a misfitting monolayer the energy due to dis- 
locations is directly proportional to their density, p. It has been 
assumed 20 that p oc d s for the formation of a ft nucleus on a, so that 
a between a and ft may be written 20 

<r = y + ij(<J-e), .... (69) 

where r\ is a constant and y is a term due to bond type and chemistry. 
It is apparent that a is a minimum when d = e. 

In the case of semi-coherent precipitation the problem of minimizing 
the elastic energy of the system is somewhat more complicated than in 
the case of a completely incoherent particle. If the moduli of the 
precipitating particle are in all directions less than the moduli of the 
matrix material, then the precipitate will be plate-like so that the strain 
will reside primarily in the precipitate and the elastic energy minimized. 
In this case the elastic energy can be evaluated in the way indicated by 
NABARRO, 69 who has shown that E for a disc shaped coherent precipitate 
is of the order of (1/5) E for a spherical precipitate. Applying these 
arguments to the nucleation of coherent ft in a we expect that ft embryos 
will probably again have the form of oblate spheroids and the free 
energy required to form them is : 

- e 2 ) 1 / 2 } + 

( 70 ) 



where K' is proportional to the elastic coefficient for the coherent 
formation of ft. The minimum free energy for forming the nucleus is : 

AG*=/(a*, e*, e*) .... (71) 

f VAN DEB MERWE 70 and BROOKS have developed the description of interphase 
boundaries in terms of a dislocation model. BROOKS has discussed in general terms the 
application v of the theory to nucleation in crystalline media. 
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where a*, e* and e* simultaneously satisfy the equations 

= 0, .... (72) 

==0, .... (73) 

==0. .... (74) 



In the case of a precipitate having all its elastic moduli larger than 
that of the matrix material, the shape of the precipitate which will 
minimize the elastic energy is that of a sphere. If, on the other hand, 
the modulus in one direction of the precipitate is less than the moduli of 
the matrix, the shape having minimum elastic energy will be rod-like. 
However, for embryos and nuclei, the anisotropy of the interfacial 
energy between particle and matrix will probably be more important in 
determining the shape of the particle than will considerations of elastic 
energy. The exact solution for the free energy of an embryo and a 
nucleus having varying elastic moduli can only be established if the 
dependence of the elastic modulus of matrix and precipitate on orienta- 
tion as well as the anisotropy of the surface energy are known. 

Theory of Catalysis of Crystal Nucleation 

In a preceding section a formal theory of nucleation catalysis was 
developed in which the catalytic potency was characterized by the 
contact angle 6. While this theory proved to be satisfactory for des- 
cribing the rate of solidification of mercury and tin droplets coated with 
various surface films, it gives little information on the mechanism of 
nucleation catalysis. 

On the basis of the postulate made in the preceding section that the 
interphase interfacial energy is a minimum when the phases are coherent 
across the boundary, a theory of nucleation catalysis has been derived. 20 
According to this theory the free energy of a crystallization nucleus 
formed on a catalyst surface is a minimum when the nucleus is at least 
partially strained in the direction of better atomic matching with the 
catalyst across the interface. 

We obtain the interfacial energy between the catalyst and the 
nucleus, a sc , from equation (69). Now relating a sc to the contact 
angle through equation (15) we obtain: 

1 - w = + *?(- )Kc, .... (75) 

where m = cos 6 and 

P=i-(*is-r)l*Lo. .... (76) 

The free energy of forming a nucleus having the shape of a spherical 
sector of radius r on a catalyst surface is 

- m)a LC + irr\l - m*)(o sc - LS ) 9 

..... (77) 
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where c is the appropriate coefficient of elasticity and F is the volume 
of the sector. A(? g * for a given strain is : 



+ ce*)*, .... (78) 
and the value of e = e* that minimizes A6r s * is : 

s^ = (d + pa LC /r j )-[(d + ^ LC /^~^G v \/c]^ ..... (79) 

When the nucleation rate is perceptible, say 1 cm~ 2 sec- 1 , A$ s * ~ 60&T 7 
and | AG r | = | A6? F '|. For cd 2 ^> | AG^'] and making the approximation 
2 -f- m = 3 we obtain | ACr r '| = /(<5) as follows : 

|A0 F '| - (47ra LC /60kT)W(pa LC + r,d) ..... (80) 
If | A$ r '| is larger than c<5 2 the nucleus forms coherently and we have : 

|A<V| =cd*. .... (81) 

The approximate condition for coherent nucleation is found by equating 
(80) and (81) to be: 

d < \rw + (7?V 2 + 4^c) 1 / 2 ]/2c = d c , .... (82) 

where // = (47ro LS /60kT) 112 . 

|A6?jr'| is proportional, equation (30), to the supercooling AT 7 ' at 
which nucleation is perceptible. For d < 6 C , AT' varies as d 2 . When 
d ^> d c and for a given value of /?, AT" varies as d. 

The potency, p, of a nucleation catalyst is proportional (see equation 
(80)) to lj(^ L c + *!$) Although p and d are reciprocally related the 
order of p is not necessarily the same as the order of l/d because of /?. 
If it is assumed that 20 

G LC = <*LS ~ 7 > .... (83)f 

ft and a particularly simple theory of nucleation catalysis results 
wherein the order of p is identical with the order of l/d. 

When equation (83) holds equations (79), (80) and (82) reduce to, 
respectively : 

e* = d-(d-\&G v \lc)V\ .... (84) 

|A<V| - (47ra jto /60M 7 ) 1/ V J .... (85) 

and 

<5 C = (47r(T LO /60^) 1 / 2 (7 ? /c). .... (86) 

There are a few data sufficiently quantitative to test the predictions 
of the crystallographic theory of nucleation catalysis. When it is 
assumed that ice nuclei form coherently on silver iodide (d = 0-0145) a 
value of AT 7 ' ^ 3-1 is calculated 20 that is in good agreement with the 
vahie AT 7 ^ 2-5 measured by VONNEGUT. Experience indicates 20 that 
nuclei probably do not form coherently on catalyst surfaces for 

' ( The plausibility of equation (83) has been discussed elsewhere 20 
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6 > 0-005 to 0-015. This result is in rough agreement with the predic- 
tion of FRANK and VAN DER MERWE's 72 theory. 

It was pointed out 20 that most of the results on the kinetics of 
heterogeneous nucleation of mercury and tin crystals can be derived on 
the assumption that the nucleus is derived from a macroscopic structure 
having a melting point little, if any, different from the melting point of 
the stable structure. These results indicate that the nuclei formed 
incoherently and with negligible strain. Because of the large super- 
cooling and the marked dissimilarity in structure between the droplet 
surface films and the forming metal crystals it is not surprising that the 
nuclei formed with s ' in these experiments. 

Theory of Martensite-type Transformations 

Theory We have been tacitly assuming that transformation nuclei 
are always characterized by the minimum possible free energy of forma- 
tion. Actually the net rate of trans- 
formation, dx/dt, of a to ft must be 
some function of the product of the 
rate of nucleation / and the rate of 
increase in volume of the nucleated 
region. The rate of increase in volume 
of the nucleated region is in general 
proportional to R 3 , where R is the 
linear growth rate. Therefore, 

(87) 




Fig. 10. Schematic representation of 

a martensite^jmbryo in an a NQW the transformation from a to 

ft will occur by a mode of nucleation 

and growth that will make dx/dt a maximum. Let the value of / 
corresponding to the minimum A6r* be /'. The structure of the nucleus 
corresponding to /' is not necessarily identical with the structure that 
leads to a maximum growth rate, J? max . We shall find that the marten- 
sitic mode of transformation is brought about by nuclei (^-nuclei) that 
require a much higher free energy for their formation than would 
nuclei of other possible configurations (jV-nuclei). Nevertheless, trans- 
formation is effected by #-type rather than N-type nuclei because the 
rate of growth of ^-regions is many orders of magnitude greater than 
that of JV-regions. 

It was pointed out that for the coherent formation of ft in a the trans- 
formation strain energy is a minimum when ft has the form of a thin disc 
and is strained in tension or compression to be coherent (normal or 
N-type coherency) with oc across a particular crystallographic plane. 

A different type of coherency results when the a matrix immediately 
surrounding the ft is strained by shear to be coherent with ft (shear or 
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$-type coherency). Fig. 16 shows schematically a ft region surrounded 
by a region of a sheared to maintain coherency across the interface. 
E for a p-S embryo is larger than E for a (1-N embryo. However, the 
atomic displacements required for the growth of /S-nuclei are in most 
transformations much less than those required for the growth of 
^-nuclei. Therefore, S grains once formed grow at rates many orders 
of magnitude faster than N grains, so that even though the ^-nuclei 
outnumber /S-nuclei most of the transformation may be effected by 
$-nuclei. 

FISHER, HOLLOMON and TuRNBULL 73 have developed a theory for the 
energetics of formation of $-nuclei. They assumed that the embryos 
form in the shape of lenticular discs of radius r and thickness t. The 
volume of this disc is 7rr 2 t/2 and its area is 2irr 2 . It is assumed that the 
strain energy is localized in a sphere of radius r surrounding the embryo. 
The strain at any point in this sphere is t/r when t <^ r. Hence, 

E = fr(tlr)* 9 .... (88) 

where c is proportional to the shear modulus of a. The free energy of 
formation of an $-embryo is then : 



AG = 2<jrr*a + >rrr 2 tkG v /2 + c<rrrt* .... (89) 

and 

A#* = f(r*,t*) .... (90) 

where r* and t* simultaneously satisfy the equations 

P(AG)/3r] 4 = .... (91) 

[3(AG)/aq r = .... (92) 
Carrying out the indicated operations we find : 

AG* 81927rcVy27(A(? t; ) 4 , .... (93) 

t* = - 1 6(r/3 AG,, .... (94) 

t/r = t*/r* = - A#j4c. .... (95) 

$-embryos grow and shrink rapidly so that down to relatively low tem- 
peratures a will contain an equilibrium distribution of f}-S embryos 
characteristic of the given temperature. Therefore, this theory predicts 
that martensite transformation nuclei only form isothermally. 

We have pointed out that nucleation in crystals may be heterogeneous 
to a marked degree because of non-equilibrated imperfections. That is, 
A(?* may vary from element to element in the a to give a spectrum of 
preferred nucleation sites characterized by different values of A6?* and 
volume, i, of the element. Provided v is larger than the volume, v*, of 
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the critical nucleus for growth of ft into a of mean free energy the 
nucleation frequency may be written : 



F,.exp[ AG,*/*T] .... (96) 
o 

where V is the volume of the specimen and V j is the total untransformed 
volume of elements characterized by A(?,*. Let I j be the nucleation fre- 
quency per volume corresponding to a particular AC?,* and for simplicity 
assume that all the preferred sites have the same volume v and yield 
only one nucleus. It then follows that there will be a dependence of 
I = I t on time at constant temperature given by : 

/ ( = (l/F)i/,F,exp [-I lV f\, .... (97) 



where V ' j = Vf at t = 0. This generalization predicts that isothermal 
martensite nuclei will form at preferred sites and that the nucleation 
frequency will decrease with increasing time. 

Experience indicates that many martensite transformations are 
athermal that is, the amount of transformation appears to be indepen- 
dent of time at temperature and is a function of temperature only. It is 
as though transformation nuclei were in existence at all times and that a 
certain number, rc , of these become active for growth when the tem- 
perature is lowered by a unit amount n Q increasing with decreasing 
temperature. 

This athermal characteristic of some martensite reactions is explained 
on the basis of either frozen-in composition fluctuations or by frozen-in 
variations in free energy from point to point in the a crystal due to local 
strains. Formally, the theory is very much the same from either point 
of view. We shall first develop the theory for the formation of athermal 
nuclei from composition fluctuations. 

When a contains a major component A and a minor one B, where B 
has a strong stabilizing effect on a relative to /?, we may expect that /? 
embryos will form preferentially in regions that, because of composition 
fluctuations, are depleted in B. Let Agr f be the increase in free energy 
associated with removing all i atoms of ,6 from a region of a of average 
composition. According to the development of FISHER, HOLLOMON and 
TuRNBULL 73 the number of these regions per volume, n i9 of pure A is : 

n ( - n exp [- bgJkT} .... (98) 

where n is the number of lattice sites per volume. The number of these 
regions n t * exceeding a critical size corresponding to i^ i* is : 

co 

n<* = jn.di = (n/i)(kgJkT) exp [- bgflkT] .... (99) 

i=t* 

where A^ t A^^* at i = i*. 
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Now suppose that at a given temperature the ft-S nucleus would con- 
tain i* atoms of B if it had the same composition as the a matrix. A$ v 
may be large enough so that the equilibrium concentration of $-nuclei, 
n* n exp [ &G*/kT] 9 is negligible. However, the number of a 
regions, n^ 9 free of B and having dimensions equal to or larger than 2* 
and r* can be appreciable. Since the volume free energy loss for forma- 
tion of ft in one of these B- poor regions (B-) is much greater than for its 
formation in a region of uniform composition, the probability that ft will 
nucleate in a JB- region is many orders of magnitude larger than the 
probability that it will nucleate in a region of uniform composition, B. 

When a ft-S nucleus forms in a B- element it will immediately grow 
to the boundaries of the element. Then if the dimensions of B- element 




Fig. 17. Distribution of embryos, depleted in a minor solution 
constituent, with respect to size 

equal or exceed t* and r* the nucleus will continue to grow into the B 
region providing the interfacial energy between B and B~ is negligible 
when the structure is the same on either side of the boundary. There- 
fore, in an alloy of the type we are considering it is possible that ft is 
nucleated entirely by B elements formed by fluctuations. 

At relatively low temperatures the composition fluctuations may be 
frozen in and the number and size of B- regions will be the equilibrium 
number characteristic of a high temperature. We shall suppose that the 
B- elements arc entirely transformed to ft. A schematic representation 
of the number of these B- regions as a function of i (proportional to 
volume) is given in Fig. 17. At any temperature there is a critical 
volume, corresponding to i*, of B- that will transform the surrounding 
a of S composition to ft. When a sample is quenched to any temperature 
T the number of these supercritical regions, hence ft nuclei, is given by 
(99). A number, 7i t -*, will grow immediately and the transformation 
will stop. When the temperature is further lowered i* is decreased and 
more ft nuclei are formed. Since these nuclei are brought into being not 
by thermal composition fluctuations at temperature but by having the 
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potential barrier to nucleation lowered they are called athermal nuclei. 
The total number that have formed upon quenching to temperature T l 
is proportional to the area oc^*, in Fig. 17. Upon quenching to T 2 
an additional number per untransformed volume, proportional to the 
area ai 1 *i 2 *fr in Fig. 17, form. 

When A(? v is sufficiently large so that the equilibrium concentration 
of ft nuclei in B regions is appreciable nucleation takes place isother- 
mally and at a steady rate. For alloys the possibility exists that ft is 
nucleated by B- regions at temperatures too high for the formation of ft 
nuclei in B regions. Nuclei formed in this way come into being only 
when the temperature is lowered and the martensite formed therefrom 
is athermal. 

By considering the heterogeneous distribution of free energy expected 
in a real crystal a qualitative theory for athermal nucleation can be 
derived without invoking composition fluctuations. Suppose that (see 
equation (96)) the volume of the preferred nucleation sites is of the same 
order as v*, the volume that the ft nuclei must exceed in order to grow 
into the a matrix of mean free energy. Then at a preferred site having a 
volume, v, ft may nucleate, grow to the site boundary, and be unable to 
grow further until the temperature is lowered sufficiently to make 
v* < v. There will be some history-sensitive distribution of preferred 
sites that will be described by a function n v f(v, ACrJ, where n v is the 
number of sites characterized by volume v and free energy A6? v . The 
number of athermal nuclei, n, formed at a given temperature can be 
calculated from n v =f(v, A6?J. n will be the number of high energy 
regions having a volume exceeding 7rr* 2 t*/2, where r* and t* are given 
by equations (94) and (95). 

COHEN, MACHLIN and co-workers 74 ' 75 have proposed an alternative 
theory for the nucleation of martensite that ignores any contribution of 
interfacial energy to the free energy of nucleus formation. Let the angle 
through which a must be sheared to form ft be a __^. There will exist in 
the a many small embryos that are only partially sheared to ft that is, 
< a _0. These ft embryos must be characterized by their shear angle 
6 and the number of atoms i that they contain. 

Now according to the COHEN-MACHLIN theory, for a given value of i, 
the free energy of the ft embryo goes through a maximum at = O m as 
6 is increased. We may imagine a three dimensional surface A 6? 
f(i, 6) for constant temperature. The problem is to find the saddle 
point in this surface corresponding to the minimum free energy that the 
embryo must have before it can grow to a large size with the free energy 
continually decreasing.")* 

f It will be noted that the COHEN-MACHLIN theory of nucleation with i and 6 the 
variables is the structure-strain analogue of the BOJRELIUS theory of nucleation of 
precipitate with i and Aa? the variables. 
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In the A = /(, 6) surface there is a single ridge (shear ridge), the 
maximum height of which is given by the relation A(? m = F(i), where 
AG = ACr m at 6 6 m . The value of i at the saddle point i = i* should 
be found by solving the equation d(&G m )/di = 0. The major difficulty 
of the COIIBN-MACHLIN theory is that the functions A6? =f(i, 6) and 
A6? m = F(i) are only qualitatively defined. We expect that the in- 
crease in free energy of the embryo due to shearing through a given 
angle, e.g. = O m , should increase continuously with i and that the 
critical fluctuation, a -> /?, will, therefore, cross the shear ridge at a 
value i -> 0. Therefore, the COHEN-MACHLIN theory seems to require, 
as does the BOKELIUS theory, an additional relation that will permit 
the calculation of the size of the region that undergoes the critical 
fluctuation. 

COHEN, MACHLIN and co-workers cite the often-observed heterogene- 
ous nature of the martensite transformation as strong supporting evi- 
dence for their theory. However, as we have seen, the surface barrier 
theory also predicts a heterogeneous martensite reaction if A6r varies 
from one element of a volume to another due to non-equilibration. 

Comparison of Theory and Experience 

Non-martens ite Transformations Although there have been extensive 
measurements of the isothermal kinetics of various solid-solid non- 
martensitic N-type transformations there is little quantitative know- 
ledge of the nucleation frequency for these transformations. It is known 
that the supercooling required to initiate N-type transformations in 
pure metals is usually very small ; for example, of the order of 5-10 in 
the y -> a iron transition. Orientation relations for these transforma- 
tions are usually consistent with the description that the low index 
cryst allograph! c planes and directions in the two structures having the 
most similar atomic packing and registry are parallel. Nuclei usually 
form preferentially in the vicinity of grain boundaries and free surfaces. 
More quantitative, carefully controlled measurements of nucleation 
frequency in Af-type transformations are necessary in order to critically 
test the predictions of the nucleation theory. 

Martensite Transformations The data on nucleation frequency for 
/S-type transformations arc no more quantitative than for JV-type but 
more information of value for comparison with nucleation theories can 
be inferred from the overall $-type transformation characteristics. 

For many years it was believed that 8 transformations were insup- 
pressible by quenching and almost entirely athermal in nature. Recently, 
however, KURDJUMOV and MAKSiMOVA 76 showed that the S transforma- 
tion in certain iron alloys could be suppressed entirely by rapid quench- 
ing to liquid nitrogen temperature and upon subsequent isothermal 
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holding at this temperature and higher an $-type transformation 
occurred that was entirely isothermal in character. As much as 50 per 
cent of the y-iron phase could transform to a by the ^-mechanism 
isothermally. The essential requirement for the isothermal mode of 
transformation seemed to be that the alloy be characterized by an M 9 
temperature of the order of 300K or less. 

About the same time AVERBACH and CoHEN 77 78 showed that there 
was a small isothermal component of the S transformation in iron 
1 per cent carbon alloys. However, the amount of alloy transformed 
isothermally was small relative to that transformed athermally. More 
recently DAS GUPTA and LEMENT 79 found that after some athermal S 
transformation in a high chromium steel a substantial amount of trans- 
formation took place isothermally. They found that considerable 
athermal transformation had occurred before the isothermal trans- 
formation began. 

KULIN and COHEN SO were unable to confirm KURDJUMOV and 
MAKSIMOVA'S result that athermal martensite formation in a 6 per cent 
manganese 0*6 per cent carbon steel could be completely suppressed by 
rapid quenching. However, it is significant that KURDJUMOV and 
MAKSIMOVA (K-M) report an M 8 for this steel about 100 lower than 
that found by KULIN and CoHEN 80 and other investigators ; it seems 
likely, therefore, that K-M made an error in reporting the composition 
of the steel. 

Recent results of CECH and HOLLOMON SI on the isothermal S trans- 
formation of a 23 per cent Ni, 4 per cent Mn, 0*05 per cent carbon steel 
have completely confirmed K-M's main conclusions. The transforma- 
tion was entirely suppressed, within the experimental uncertainty, 
upon rapid quenching to 196. Upon holding at various temperatures 
between 196 and 80 transformation occurred isothermally and as 
much as 50 per cent of the specimen transformed by the isothermal 
mode. The results were satisfactorily described by a conventional time- 
temperature transformation (T.T.T.) diagram (Fig. 18) with a nose 
temperature at 126C. Particular isotherms were described fairly 
well by straight line relations between the fraction transformed and the 
logarithm of the time. Metallographic examination disclosed that the 
rate determining step in the transformation is nucleation and not 
growth. K-M and DAS GUPTA and LEMENT 79 also found that nucleation 
is the rate-determining step in the isothermal transformation. 

PiSHER 82 established that CECH and HOLLOMOJST'S (C-H) isotherms can 
be described on the assumption of a constant nucleation frequency per 
untransformed volume. The rapid decrease in transformation rate with 
time is due to a partitioning effect whereby martensite plates already 
formed cut down the volume that can be transformed by nuclei formed 
at later stages of the transformation. FiSHEB 82 also demonstrated that 
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the C-H T.T.T. curves are described remarkably well by the FISHER- 
HOLLOMON-TURNBULL (F-H-T) theory of thermal martensite formation 
(equations (88) to (95)). Since the rate determining step is nucleation 
the temperature Tj^ corresponding to the nose of the T.T.T. curve can 
be calculated by substitution of equation (93) into (28). Upon per- 
forming this operation using JONES and PuMPHREY's 83 free energy data 
and assuming &G A = 0, FISHER calculates T m = 147K, in excellent 
agreement with the measured T m = 147K. Further, with a single 
disposable parameter (o- 3 c 2 ) the entire T.T.T. curve was described from 
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Fig. 18. T.T.T. curves for formation of martensite in Fe-Ni-Mn-C alloy. 
After CECH and HOLLOMON SI 



equations (24) and (93) within experimental error. K v (see equation 
(24)) was estimated from the order of magnitude of / to be 10 36 sec" 1 cm- 3 , 
which agrees very well with the value 1C 37 sec" 1 cm~ 3 estimated from 
equation (24). This agreement between theory and experience is 
certainly not trivial ; it constitutes powerful support for the F-H-T 
theory of nucleation without composition fluctuations in S trans- 
formations. 

F-H-T have favoured the view that athermal martensite can be 
accounted for only by frozen-in composition fluctuations. For example, 
in iron base alloys the elements most effective in retarding the martensite 
transformation are those that decrease the free energy driving trans- 
formation. When the composition permits, fluctuations occur that lead 
to the formation of an appreciable concentration of regions free of 
alloying element and nucleation of a in these regions is much more 
probable than in regions of uniform composition. When an alloy is 
quenchejl to a particular temperature the total number, N, of martensite 
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plates that have formed per volume and when the fraction transformed 
is not large is : N = It + n<* ... .(100) 

where / is the nucleation frequency per unit volume for regions of 
uniform composition {7 K v exp [ ACr*/fcjP], where AG* is given by 
equation (93)} and n t * given by equation (99) is the number of plates 
formed from nuclei produced by frozen-in composition fluctuations. 
The first term of equation (100) is the thermal component and the 
second term the athermal component of the nucleation. When the 
alloying content is very large (e.g. 20-30 atom per cent) the probability 
of forming a large region of pure parent metal is small, so that n t * is 
small and the thermal component, It, may predominate. However, if 
the alloying content is quite small n t * may become large before It 
becomes important and most of the transformation will take place by the 
athermal mode. A more quantitative theory of the relative magnitude 
of the thermal arid athermal components is desirable. 

F-H-T have demonstrated that the variation in M 8 temperature with 
composition for the formation of athermal martensite in iron alloys 
containing carbon and chromium 84 ' 85 is described by the composition 
fluctuation theory with the use of only one disposable parameter which 
is cr, the a-y interfacial energy per area. Also the data of HARRIS and 
CoHEN 85 on the amount of athermal martensite formed as a function of 
temperature in a carbon-chromium steel were described within experi- 
mental error with c (see equation (93)) the only disposable parameter. 
It was shown that the values of a (24 ergs per cm 2 ) and c (0-46 X 10 10 
dynes per cm 2 ) chosen to give agreement between theory and experiment 
are plausible. 

It has not yet been established whether or not composition fluctua- 
tions are a necessary condition for the athermal mode of martensite 
nucleation. Athermal martensite is known to form in the low tempera- 
ture transition face-centred cubic --> body-centred cubic in lithium 86 
and in the cobalt 87 (face-centred cubic -> hexagonal close packed) 
transformation. However, these substances were not of highest purity 
and no systematic study on the effect of alloying elements on the 
kinetics of the martensite transformation in these metals seems to have 
been made. In order to find whether or not composition fluctuations 
are a necessary concomitant of athermal martensite in these metals it 
will be necessary to make such a study on the effect of alloying elements. 
The amount of martensite that forms, either thermally or athermally, 
at a given temperature r l\ may be materially decreased by decreasing 
the rate at which the specimen is initially cooled to T^ This pheno- 
menon is called stabilization. For example, CECH and HOLLOMON SI 
found that the rate of isothermal transformation in the Fe-Ni-Mn alloy 
is substantially reduced by long holding times at room temperature 
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before quenching to the transformation temperature. Further if after 
the isothermal transformation is underway at a temperature T l the 
sample is upquenched to the ambient temperature and then quenched 
to T l9 the transformation rate at 7\ is very much less (by an order of 
magnitude) than before the upquenching. 

COHEN, MACHLIN and PARANJPE 74 have suggested that stabilization 
results from the lowering of the stress levels at preferential nucleation 
sites in y-iron by a recovery process. That is, stresses are developed in 
certain volume elements of the y due to quenching, for example, a 
nuclei probably prefer to form at these stressed sites. These stresses 
may be partially relieved by recovery processes so that nucleation of a 
is less probable hence, stabilization occurs. 

STRUCTURE STRAIN AND COMPOSITION 
Theory 

An embryo for precipitation is described by its structure, composition, 
size, shape and state of strain. The problem of finding the most probable 
values of these variables for the critical cluster or nucleus is formidably 
complex. The free energy of formation of such a cluster, that is 
incoherent and has the composition of the equilibrium precipitate, is 
given by equation (65), except that now A6? will be a function of x. 
For an oblate spheroidal form we have : 

1/2 a 2 - 21 / 2 



AG, = W(l - e a ) 1/2 a + <J 2 /[(1 - e 2 ) 1 / 2 ] 



The free energy of the critical cluster ACr z * = f(e* 9 a*) is found in the 
same way as before (see equations (67) and (68)). 

For reasons similar to those already outlined in the section on struc- 
ture and strain, precipitation nuclei may prefer to form coherently or 
semi-coherently in order to minimize their surface energy. The expres- 
sion for the free energy of formation of semi-coherent clusters is 
analogous to equation (70), if the precipitate is softer elastically than 
the matrix, 



- e 2 ) l/2 ]} 

[d _ e 2\ 
2 + L__' i 



where K f is the elastic coefficient appropriate to the coherent straining 
of the cluster. The free energy of the critical cluster A6^* /(a*, e* 9 e*) 
is found by the procedure indicated in the earlier equations, (72) 
through t (74). 
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If 6 is relatively small and y is relatively large, clusters having a nearly 
spherical shape will form. Following FISHER, HOLLOMON and LESCHEN 88 
the free energy of forming a partly coherent spherical cluster is written : 

A(?,. = [y + rj(d - )]47rr 2 + (4/3)7rr 3 (AG v + ce 2 ) .... (103) 

where AG^ = &G 2 /V and c is the appropriate elastic coefficient. It 
follows that the free energy of formation of the critical cluster is : 

C6* 2 )^ . . . .(104) 
+ SAG^/c} 1 ' 2 , .... (105) 




where * = 2(6 + yh) { 4 (d + 
and the condition for coherency is : 

|AGg > cc5 2 {l + 4y/3/7<3} = Z. .... (106) 

When the supersaturation is very large \&Q V \ > Z and the nucleus 

forms coherently. When Z ^> | AC? V |, d ^> e. 

One important consequence of the 
theory of precipitation is that the 
coherent precipitate can form and 
grow only at temperatures well below 
the temperature at which the com- 
pletely non-coherent precipitate is in 
equilibrium with the solid solution. 
This consequence was recognized by 
NABARRO 69 and BARRETT 89 and 
has been treated in more detail by 
LESCHEN and FiSHER. 90 Following 

Fig. 19. Comparison of free energy F ISHEBj HOLLOMON and LESCHEN 88 
composition dependence of coherent f relations between solid 

and non-coherent p phases. After tot7 

FISHER, HOLLOMON and LESCHEN" solution, a, and coherent, ft, and non- 
coherent, /? c , phases are shown sche- 
matically in Fig. 19. The free energy of ft c at a given x exceeds that of ft 
by an amount cd 2 . The temperature at which ft is in equilibrium with a 
is AT lower than the temperature at which a and ft are in equilibrium. 

As pointed out by NABARRO, where d is relatively large, e.g. > 010, 
it is sometimes not thermodynamically possible to form a completely 
coherent precipitate. 

The problem becomes additionally complex when the cases of rigid 
and semi-rigid precipitates are taken into account. Then the shape and 
energy of the particle will depend on the relative elastic moduli and the 
anisotropy of the surface energy. 

Comparison of Experience and Theory 

To the best of the authors' knowledge there have been few quantitative 
measurements of the nucleation frequency of precipitate frqm solid 
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solutions. However, some generalizations can be made from qualitative 
observations. GEiSLER 91 92 has emphasized that the vicinity of grain 
boundaries and slip bands are preferred sites for the nucleation of 
precipitate when the supersaturation is not large. Thus non-equilibrated 
structural defects, such as grain boundaries and possibly dislocations, 
apparently catalyze nucleation of precipitate. The preference of nuclei 
for grain boundary sites is understood, since grain boundary energy is 
released (see equation (61)) by the formation of a cluster thereon. It is 
not known to what extent nucleation of precipitate is catalyzed by 
accidental inclusions. 92 

A second important generalization is that there exists always a 
definite crystallographic orientation relation between the precipitate 
particles and the matrix. In general, this relation is that the planes and 
directions in which the atomic packing is most similar are parallel in the 
two lattices. For example, in the precipitation of a face-centred cubic 
precipitate from a face-centred cubic matrix the cube edges and cube 
planes in the two lattices are usually parallel. This result might be 
explained either on the basis that only the particles with the preferred 
orientations nucleate or grow rapidly. However, on the basis of the 
theory that has been developed there is good reason for expecting the 
nucleation of preferred orientations.* 

From the dislocation theory of interphase boundaries it is expected 
that their interfacial energy will be a minimum when d is a minimum. 
Therefore, on the basis of surface energy considerations only, clusters 
having an orientation relation with the matrix that minimizes d will 
form preferentially. The observed orientation relationships for precipi- 
tation are in general consistent with the principle that 6 be a minimum 
for low index planes. 

It seems probable that the dominant factor controlling the orientation 
relation of an incoherent nucleus with the matrix is surface energy and 
that strain energy will have little effect. This conclusion follows from 
the NABARRO theory that the major part of the strain is generally taken 
up by the matrix when incoherent clusters are formed. However, when 
the nucleus is partly coherent with the matrix it appears that the 
orientation relation will be controlled by the relation of the elastic 
coefficients and interfacial energy to crystal direction. For if the 
coherency strain is taken up by the cluster it is evident that the 
coherency plane relation corresponding to the minimum c will minimize 
the strain energy. The orientation relation that minimizes the strain 
energy of the critical cluster does not necessarily minimize the interfacial 
energy. 

In summary the problem of nucleation of precipitates from solid 

* It is certain that an "oriented growth" theory cannot account for preferred orienta- 
tions of precipitates in a cubic matrix when the growth is diffusion controlled. 
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solution is very complex and the most useful approximations have not 
yet been established. Even so, the general nature of such precipitation 
can be described. The precipitation process that will be observed at any 
temperature below the solvus will be the one that occurs at the maxi- 
mum rate at that temperature. At temperatures just below the solvus 
the precipitation will occur heterogeneously and incoherently. If the 
temperature is further lowered, the rate of overall precipitation can be 
increased either by an increase in the rate of nucleation or by an increase 
in the rate of growth. Such an increase can be accomplished by adjust- 
ing the composition or structure to minimize the surface energy between 
precipitate and matrix or by adjusting the composition and structure 
to allow a more rapid rate of growth of the precipitate. One way in 
which the surface energy is reduced is by the formation of coherent 
precipitates which can occur only at temperatures sufficiently below the 
solvus to make the free energy available for straining the precipitate. 
Thus at temperatures below that at which incoherent precipitation 
occurs coherent homogeneous nucleation will occur at a more rapid rate. 
At still lower temperatures, the rate of overall transformation can be 
further increased by allowing the precipitate to form with the same 
composition as the matrix thus eliminating the necessity for long-range 
diffusion and forming a coherent particle in such a way that transport 
of atoms across the interface can occur rapidly. This latter type of 
transformation has been given the name martensite. Thus three dis- 
tinct types of precipitation are possible : first, coherently nucleated, 
second, incoherently nucleated, and martensitic. There are obviously 
intermediate types, for example, when the precipitation occurs partially 
coherently. 

There is much valuable data on orientation relations in precipitation 
that can be qualitatively explained. Also there is reliable information on 
the coherency relations in many systems. Unfortunately, there are few 
data on the isothermal course of precipitation as a function of time and 
even less quantitative data on the nucleation frequency in precipitation 
from solid solution. There is a great need for such data to test the 
prediction of current theories of nucleation of precipitate and to aid 
their further development. 
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Pole mechanism of deformation twinning 

223 

Polygon i /at ion mechanism 251, 252 
Pore size of sintered metals 181-3 
Praseodymium nitride, structure 109 
Precipitation, thermodynamics 359-61, 

364 
Pressure of non -metals, effect on reaction 

with metals 79-82, 96 

of oxygen, effect on electrical con- 

ductivity 74, 75 

Purity of motal, effect on oxidation rate 
80, 88 

RATIONAL rate constant 77, 81 
Reaction rate theory 295, 296 
Recovery, heat of activation 52 

theory 252-9 

Reery stall i/.atioii 255-60, 277 
References 67, 102, 167, 203, 260, 330, 
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Relaxation behaviour of metals 1 1 7 c t scq 

strength 15-19, 35, 36, 121 

times, superposition 19, 20 
Relaxed modulus 17 
Resonance curve, equation 9 
Rhodium, supercooling 349 
Ring diffusion 301, 302 
mechanism 308, 309 

SCANDIUM nitride, structure 109 
Screw dislocations 208, 212, 235 
SCHOTTICY mechanism of diffusion 74, 75 
Self -diffusion and chemical diffusion, 
interconnections 322, 323 

anelastic methods of measurement 

316-20 

coefficients, estimation 310 



Self-diffusion (contd.) 

in metals 284-6, 292, 315 

in oxides 81 

mechanism 305-10 

sintering method of measurement 3 1 5,> 

316 

Semi-coherent precipitation 371 
Semi-conductors, defects in 74, 75 

diffusion in 74, 95 

electrical conductivity 75 

electrical conductivity, effect of alloy- 

ing elements 85 et seq 
Sessile dislocation 220, 249 
Silver, creep 199 

diffusion in 320 
-electrical conductivity 237 

reaction with bromine 78, 83 

self-diffusion 197, 198, 284-6, 303-16 

sintering 174-83, 197 

-- solubility of oxygen in 111 

surface self -diffusion 294 

surface tension 187-9, 356, 357 

- water vapour embrittlement 162, 163 

work hardening 245 

Silver alloys, attack by halogens 83-5 

diffusion in 318, 322 

effect of dissolved gases on electrical 

conductivity 144, 145 

internal friction 43 

internal oxidation 1 54 (i 

Kirkondall effect 270-3 

relaxation effect 40 

relaxation time 44 

surface self -diffusion 294 

vacancy mechanism 45 

Silver halides, formation on silver 78, 83 

internal stresses 228 

solubility of ions 85 et seq 

Sintering, effect of stress 186 

mechanism 175 et seq 

method for measuring self-diffusion 

coefficients 315, 316 

surface diffusion 196, 197, 199 
-- volume diffusion 196 et seq 
Slip 259, 260 

bands, dislocations 228-30 

in body centred cubic crystals 221, 222 
- in close packed hexagonal crystals 221 

in face centred cubic crystals 21821 
Sodium chloride, lattice defects 74 

work hardening 247 
Solidification, size effect 349 

of droplets coated with films 352, 353 

of small droplets 349-55 
Solubility in metals 

hydrogen 112, 113 

nitrogen 107-10 

of ions in nickel oxide 9 1 

of ions in silver halides 85 et seq 

oxygen 110-12 

Specific heat of metals containing gases 

147 

Sphere of relaxation 47 
Spinels, formation on metals on oxidation 

92-4 
Standard linear solid 15-19, 31, 45, 47 
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Static hysteresis 22-4 

Steel, effect of hydrogen on mechanical 

properties 165, 166 
Strain ageing 240-3 

effect of temperature 242 

energy 306 

rate 190 

Stress induced ordering, source of internal 
friction 30 et seq 

relaxation across incoherent interfaces 

46 et seq 

Structure of nitrides 109 

Subscale 151 et seq 

Substitutional solutions, anelastic be- 
haviour 41-3 

diffusion in 300 

stress -induced ordering 40 et seq 

Supercooled liquids, crystallization 348 

Supercooling of metals, dependence on 
crystal structure 357, 358 

Superheating of solids 359 

Supraconductive transition of tantalum 
and niobium 142 

Surface diffusion 280 et seq, 31(5 

and sintering 196-9 

energies, liquid-crystal 356 
liquid-vapour 341,342 

energies of solids, determination 187 

self-diffusion of silver 294 

TANTALUM, diffusion of carbon in 40 

effect of carbon on relaxation pheno- 

mena 36 

effect of dissolved gases on relaxation 

phenomena 36 

effect of dissolved gases on electrical 

conductivity 141, 142 

relaxation phenomena 36, 118-21 

sintering 179 

solubility of nitrogen in 108, 109 

solubility of oxygen in 120 
Tantalum alloys, diffusion in 298, 319 
Tarnishing reactions 71 et seq 
Tensile strength, sintered metals 185 

titanium containing gases 149 

Thermal diffusion 61, 62 
Thermoelastic internal friction 60-2 
Thin film formation 97 

Thorium, effect of gases on photoelectric 
activity 126 

solubility of nitrogen in 108, 109 
Tin, liquid-crystal surface energy 356 
- self-diffusion 292 

- solidification of small droplets 354 

supercooling 349, 357 
Tin alloys, nucleation 365-7 
Titanium, effect of gases on electrical 

conductivity 1 43 

effect of gases on lattice spacing 130, 

131 

effect of gases on mechanical properties 

148, 149 

effect of gases on photoelectric activity 

125, 126 

reactions with gases 117 

solubility of nitrogen in 108, 109 



Titanium (contd.) 

solubility of oxygen in 112 
Titanium alloys, mechanism of oxidation 

88, 89 

effect of gases on mechanical pro- 
perties 149 

Titanium dioxide, effect of additions on 
electrical conductivity 88, 89 

defect structure 80, 88, 89 

sintering 184 

Titanium-hydrogen, magnetic properties 
134 

Titanium nitride, structure 109 

Torsion pendulum 25, 26 

Transference in metallic solutions 146 

Transformation stresses in nuclei 368, 
369 

Transit nucleation 338, 339 

Tungsten, self -diffusion 308 

sintering 177,178 

solubility of nitrogen in 108, 109 

surface self-diffusion 294 
Tungsten alloys, diffusion in 199 
Twin interfaces, internal friction 49 



UNIT dislocations 216-18 
Unrolaxed modulus 17 
Uranium, solubility of nitrogen in 
109 



108, 



VACANCIES, annihilation 276-8 

equilibrium concentration 306 

generation 236-8, 275 

Vacancy mechanism of diffusion 45, 
274-8, 300, 303-5, 308 

Vanadium, solubility of nitrogen in 108, 
109 

Vanadium nitride, structure 109 

Vibrational energy, fractional decrease per 
cycle 1 1 

Viscoelasticity 15, 21, 22, 55 

Viscous flow of solids 188 et seq 

Viscosity equation 6 

Voigt solid, internal friction 5 

WATEK, liquid -crystal surface energy 
356, 357 

solidification of small droplets 355 

supercooling 348, 349 

surface energies of droplets 341, 342 

vapour, embrittlement of metals by 

158-64 

Wolfram (see Tungsten) 

Work-function of metals, effect of gases 
124-6 

Work hardening, dependence on tempera- 
ture 251 

theory 243 et seq 

of cubic metals 244-8 

of hexagonal metals 244-8 



YIELD point 238-42 

dependence on temperature 

of metals 150 

theory 241 

Young's modulus 34, 63, 65 



240 
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ZINC, internal friction of annealed and Zinc oxide (contd.) 

cold worked single crystals 53 lattice defects 75 

mechanism of oxidation 79, 98101 sintering 184 

sintering 174, 175 Zirconium, effect of gases on photoelectric 

work hardening 244 activity 124, 125 

yield point 150,241 solubility of nitrogen in 108,109 
Zinc alloys, mechanism of oxidation 86-8 solubility of oxygen in 112 

Zinc oxide, diffusion in 98 Zirconium-hydrogen, magnetic properties 

electrical conductivity 75,79 134 

effect of alloying elements on elec- Zirconium nitride, structure 109 

trical conductivity 85, 86 Zirconium-oxygen solutions 147 



403 



